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@ A hybrid polypeptide Is disclosed comprising 
an avidin-binding polypeptide containing a 
biotin binding domain fused to a polypeptide of 
interest wherein the avidin-binding polypeptide 
is "upstream" of the polypeptide of interest 
The hybrid polypeptide is produced by recom- 
binant DNA techniques. The hybrid polypeptide 
may also contain a cleavage site for cleaving the 
polypeptide of interest from the avidin-binding 
polypeptide by using an appropriate proteolytic 
or chemical reagent The hybrid polypeptide is 
expressed in appropriate host cells transformed 
with the DNA expression vector encoding the 
hybrid polypeptide, and nnay be recovered from 
crude cell extracts in high yield and high purity 
using avidin afHnity chromatography. Following 
avidin afFinity purification, the polypeptide for 
attachment and polypeptide of interest may be 
cleaved to yield polypeptide of interest in a 
highly pure and highly active state. 
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The present invention relates to a hybrid polypeptide. 

In particular, the present invention relates to a recombinant hybrid polypeptide comprising a polypeptide 
of interest fused to an avidin-binding polypeptide. The avid in-binding polypeptide contains a biotin attachnnent 
domain. 

5 Mora particulariy, the present invention relates to a hybrid polypeptide comprising a polypeptide of Interest 

fused to a biotinylated polypeptide that can bind to avldin. 

The present invention also relates to a nucleic acid sequence that encodes for the hybrid polypeptide, a 
process for producing the same and a process for recovering same. 
In particular, the nucleic acid sequence is a DNA expression vector. 
10 Generally, the synthesis of comrr>ercially important peptides and proteins has been limited by high produc- 
tion and purification costs and, also, poor product recovery. Until recently, animals, micro-organisnDS, plants, 
cadavers, serum, and urine have been the only sources from which bioactive polypeptides could be purified. 

However, advances In recombinant DNA technology have made the biological synthesis of valuable poly- 
peptides possible and in commercial quantities. In this regard, recombinant DNA molecules directing the syn- 
15 thesis of convnerclally useful polypeptides can be introduced into procaryotic or eucaryotic expression systems. 
For example, recombinant DNA technology has enabled human growth hormone production by recombinant 
bacteria and, today, fermentation replaces the traditional source. 

Todate, biological synthesis is the only practical approach to the commercial-scale synthesis of peptides 
of greater than 20 amino acid residues. Once synthesized, the desired polypeptide product must t>e purified 
20 from a complex mbcture of cellular components. The degree of purification depends upon the Intended appli- 
cation of the polypeptide. The cost of purification can account for up to 70% of the cost of production, as sub- 
stantial losses of active ingredient usually occur during multistep purification processes. 

Polypeptide purifications are usually achieved through one or more processes which are based upon phys- 
ical properties of the polypeptide of interest. For example, proteins may be separated on the basis of solubility, 
25 size, ionic properties or affinity for specific ligands; usually several of these techniques are required to achieve 
acceptable purity. 

Affinity resin chromatography can greatty reduce the number of purification steps required to achieve the 
desired level of purity. AfHnity purification is based upon a specific binding interaction between a polypeptide 
to be purified and a ligand which is usually attached to a solid support As used herein, the polypeptide binds 

30 to the ligand by virtue of a pmsthetic group bound to an attachment donr^in present on the polypeptide. When 
a complex mixture such as a cell extract or crude mixture of synthetic peptides is passed over an affinity resin, 
the polypeptide to be purified is selectively retained by the resin and all molecules lacking the prosthetic group 
on the attachment domain are washed away from the resin. 

Therefore, in a single step, the polypeptide of interest may be recovered in high purity. 

35 In order to use affinity chromatography to advantage for polypeptide purification, recombinant DNA tech- 

nology can be used to construct chimeric gene fusions for recombinant hybrid polypeptides which in bacterial 
host cells incorporate the following elements: a 5' promoter, DNA coding for a polypeptide of interest; DNA cod- 
ing for a polypeptide that contains a ligand binding donrmin; and optionally ribosomal terminators, such as the 
rmB terminators found on the E. coli expression vector pkk223-3 (Brosius, j. and Holy A., Proc Nat Acad Sci 

40 USA 81:6929-6933 (1984); Brosius, j. et al Plasmid 6:1 12-1 18 (1984)). 

Suitable promoters are those which nrtaximize expression of tiie desired gene in tiie host cell, and factors 
to be considered in promoter construction are discussed by Old and Primrose in Chapter 7 of Principles of Gene 
Manipulation 3rd Edition (Blackwell Scientific Publications, Palo Alto CA 1985). Examples of bacterial promot- 
ers appropriate for expression of cloned genes include the PL, tac, lac, and trp pronrtoters (ibid). 

45 The DNA used to construct chimeric gene fusions can be obtained from organisms or can be novel synthetic 

DNA firagnrwnts, or combinatbns thereof. The DNA sequences are assembled into a chinr^ric gene, which is 
inserted into a DNA expression vector in such a nnanner that in the appropriate host organism, the polypeptide 
of interest and ttie polypeptide for attachment to the affinity resin are produced as a single polypeptide chain. 
Other systenns for affinity purification of hybrid recombinant polypeptides are known. Klowever, significant 

50 technical obstacles limit their use for commercial-scale polypeptide purification. For example, chimeric genes 
encoding polypeptides containing a polyarginine C-terminal tail (Sassenfeld and Brewer, Biotechnology 2:76- 
81 (1984)) or pdyhistidine domain (Smith et al. J. Biol Chem 263:721 1 (1988)) can facilitate separation by ion 
exchange or metal chelate ion chromatography. Such systems are not broadly applicable because affinity in- 
teraction depends upon physical properties of the fusion polypeptide (changeability to chelate metals), and it 

55 is not always possible to achieve sufficient change in these physical properties to permit affinity binding. 

Another type of affinity chromatography is immunoaffinity chronr^tography, wherein polypeptides of interest 
are fused to immunogenic proteins such as E. coli beta-galactosidase (Ruther and Muller-Hill, EMBO J 2: 1 791 - 
1794 (1983)) or small hydrophilic peptides (Hopp etal., US 4,703,004 (1988)) to achieve purification. Polypep- 
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tides fused to staphylococcal protein A can be punfied using IgG-Sepharose (Nilsson et al. EMBO J. 4:1075- 
1080 (1985). Lowenadler et al. EMBO J. 5:2393-2398 (1986)). Polypeptides lused to Protein G can t>e isolated 
using albumin as ttie inrvnobilized ligand (Nygren et al. J. Md. Recognition. 1:69 (1988)). 

A critical disadvantage limiting ttie usefulness of these prior art methods is that extreme conditions, indud- 
5 ing the use of denaturants, are necessary to renwve the fusion proteins from the affinity resin, which may de- 
stroy biological activity if native folding cannot be achieved. Low product recovery rates can also limit the use- 
fulness of such systenDS. 

Affinity based upon the binding of small nr>olecules by a large protein is known as substrate-affinity chro- 
nruitography. A small nDolecule. a ligand, forms a complex with a specific ligate. Examples of ligand:ligate com- 

10 binations indude avidinibbtin (Green in Advances in Protein Chemistry Vd 29 pp 85-133 Anson et al.. 
Eds.(1975)). streptavidinibiotin (PCT/US85/01901, Meade and Garvin (1985)), lipoic acid:avidin (Green ibid), 
chloramphenicd acetyl transferase:acetyl CoA(EPO 01 31363, Bennetetal. (1984)), beta-galactosidase: para- 
aminophenyl-beta-D-thio-galactoside (Offensberger et al. Proc. Natl Acad. Sci USA 82:7540-7544 (1985)), 
phosphate binding protein: hydroxyapatite (Anba et al. Gene 53:219 (1987)). maltose binding protein:starch 

15 (EPO 286239, Guan et al. 1 988), and glutathione S-bBnsferase:glutathione (Smith and Johnson Gene 67:21- 
30 (1988)). 

In recent years, the unique properties of the prosthetic group biotin and its exceptionally high affinity (1015 
M-1) and specificity for the proteins avidin and streptavidin (Green ibid.) have been exploited to devise powerful 
and widely applicable tods for microbiology, biochemistry and medical science (Wilchek and Bayer Analyt Bio- 

20 chem 171:1-32 (1988), Bayer and Wilchek Methods in Biochem Anal 26:1-45 (1980)). 

Bbtin is a prosthetic group found on only a few protein spedes (Ann N.Y. Acad. Sci 447:1-441 , Dakshina- 
murti and Bhagavan, Eds. (1985)). Attachment in vivo is mediated by bk)tin hobenzyme synthetases which 
recognizes a highly conserved attachment domain and catalyzes the covalent attachment of biotin to that do- 
main (Wood et al, J Biol Chem 225:7397-7409 (1980); Shenoy and Wood, FASGB S 2:2396-2401 (1988)). 

25 Experiments using recombinant DNA technology have shown that biotin holoenzyme synthetases will bio- 

tlnylate heterologous pdypeptides containing this conserved attachnDent domain. For example, the 1.3S sut>- 
unit of the enzyme transcartx)xylase from Propionibacterium, which contains the conserved sequence, when 
doned and expressed in E. cdi is biotinylated by the E. coli synthetase (Murtif et al. Proc Nat Acad Sd USA 
82:5617-5621 (1985)). 

30 polypeptkJe or part of ajKilypeptide oontaining ttie conserved bk>tin attachment donnain, such as erUire 

1.3S (SEQ ID N0 :1) proteip Q Lthe bj ogrh binding recognit ion sequence identified within the 1.3S protein from 
Proptoniba^ri um, (S EQ ID NO:2) can be incorporated into a hybrid recoml3inanrpolypeptb^^^^ a hybrid 
polype ptkie containing a biotin attachment domain fused to one or more polypeptides of interest could be used 
to achieve the separatron of virtually any recombinant protein t>ased upon the affinity of the ligand avidin for 

35 the ligate bk)tin. 

Avidin: biotin chromatography shares advantages generally applicable to substrate affinity chromatography 
systems for commercial-scale polypeptide purification. Substrate-affinity resins are generally inexpensive. Fu- 
sion proteins can be recovered using mild conditions by elution with free ligand. 

Post-translational addition of the biotin prosthetic group is independent of the final fdded state of the protein 

40 (Wood et al. J Bid Chem 255:7397-7409 (1980)), an advantage when the host cell perfonns no post- transla- 
tional modifications on the recombinant polypeptkle. 

A ligand domain such as the donnain directing biotin attachment would therefore t>e particularly advanta- 
geous for recovery of flisbn proteins found in inclusion t>odies or for recovery of insduble proteins which require 
denaturants or zwitterfonb detergents for solubilization during extraction, prior to affinity chromatography. 

45 PCT WO 90/14431, which names Cronan as an inventor, disdoses a hybrid DNA sequence encoding a 
fusion protein comprising a first DNA sequence which encodes an amino acid sequence that allows for post- 
translatk>n nnodrfication of the fusion protein; and a second DNA sequence joined end to end with the first DNA 
sequence and in the same reading frame, the second DNA sequence encoding a selected protein or polypep- 
tide. In each of the examples tiie first DNA sequence is fused to the 3' end of the second DNA sequence (i.e. 

50 the first DNA sequence is downstream of the second DNA sequence). 

Also disdosed disdosed in PCT WO 90 / 14431 is a vector comprising the hybrid DNA, a host transfonned 
with the vedor, a method of producing a fusk>n protein by culturing the transfonmed host under conditions per- 
mitting expressk>n of the fusion protein, a fusion protein comprising a sdected protein or polypeptide linked to 
an amino add sequence that allows for post-translation modification of the fusion protein and a method of iso- 

55 lating the fusion protein comprising providing a binding partner that binds to the fusbn protein only after it has 
t>een modified, contacting the modified fusk>n protein with the binding partner under conditions permitting bind- 
ing, separating the rTK>drfied fusion protein bound to the binding partner from unbound n^aterials in the mixture, 
and eluting the modified fusion protein. 
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The work of Cronan is also disclosed in J Biol Chem 265:10327-10333 (1990) (Cronan) wherein a recom- 
binant DNA plasmid from E. coli (Murtif et al. Proc Nat Acad Sci USA 82:5617-5621 (1985)) was used to con- 
struct fusion genes containing segments of the 1.3S gene, which contain the biotin attachment donnain. 

Cronan (ibid) demonstrated that 1.3S sequences can be used to specifically label proteins in vivo, and to 
5 purify proteins from crude cell lysates by avid in affinity chromatography. 

As in each of the examples of PCT WO 90/14431 , Cronan's (ibid) chimeric genes were constructed by fus- 
ing the 3' end of the genes of interest to the 5' end of the 1.3S gene (i.e. the 1.3S gene is downstream of the 
gene of interest), yielding hybrid recombinant polypeptides having the polypeptides of interest fused to the N- 
tenminus of the 1 .3S polypeptide. 
10 The PCT WQ JjO/l 4 431^d Cronan fusions are consistent with the teachings of Murtif and Samols (J Biol 
Chem 262:11813-11816 (1987)) who teach the fys jpn_gf ttje 3' end of the gene of interes t to the5!_end of the 
13Sjaene^ttTe^bjgn ginuscf^ to avoid interfering with the attachment of biotin to its binding 

domain. Murtif and Samols (Ibid) teach that the conformation of the COOH tenminus of the 1 .3 S polypeptide, 
and the spatial relationship between this region and a lysine residue positioned exactly 35 residues from the 
15 COOH terminus position to which biotin is attached in vivo, are essential for proper enzymatic recognition and 
biotinylation of the 1 .3S polypeptide. 

Murtif and Samols (Ibid) further teach that the conformation of the cart>oxyl terminal ri^ion of the 1 .3S poly- 
peptide is critical for t)iotinylation, and that altering the hydrophobe carboxyl terminai region of the 1 .3S 
polypeptkle "elimiriates bipt]^^ ~~ 
20 Murtif Samols did ok>serve biotinylation of 1.3S polypeptides, each lengthened by two amino acids at the 
1.3S cart>oxyl terminus. However, such additions of two amino acids to the C-terminus did not substantially 
change its hydrophobicity and such small additions would not be expected to change the conformation of the 
C-terminus. 

US-A-4782137 discloses a series of recombinant DNA techniques for preparing a hybrid polypeptide con- 

25 sisting of an identification peptkJe and a desired functional protein. The identification peptide has an antigenic 
terminal and a cleavable linking portion disposed between the antigenic terminal portion and the protein fno\~ 
ecule. TTie I inked{l inking portion of the kjentification peptide Is cleavable at a specific amino acid residue ad- 
jacent the functional protein by use of a sequence specific proteolytic enzyme or chemical agent. When the 
protein is cleaved from the isolated hybrid polypeptide, mature functional protein in a highly purified and active 

30 state is released. As with Murtif and Samols (Ibid), PCT WO 90/1 4431 and Cronan (ibid), US-A-47821 37 only 
discloses the use of a desired functional protein "upstream" of an identification protein. 

US-A-4839293 disbses a process for preparing a fused gene consisting of a sfreptavidin gene fused to a 
gene encoding the human LDL receptor. Methods are also disclosed that utilise the fused gene to produce lat>- 
elled, chemically nnodified proteins in vivo and also to isolate a protein knowing only the nucleotide sequence 

35 of the gene encoding the protein. The fused gene comprises a first DNA fragment encoding a target protein of 
Interest fused to a second DNA fragment encoding streptavidin which has a multiplicity of binding sites for bk)tin 
or a biotin derivative. The fused gene of US-A-483d293 is capable of expressing a fused protein in vivo when 
the gene is inserted into a suitable expression vector and introduced into a suitable host cell. The fusion proteins 
are separated by the addition of bk)tin. Apparently, this method overcomes the diadvantages associated with 

40 the then-known commercial preparations of streptavkJin as it utilises a biotin contaminant-free source of strep- 
tavidin which has all four valencies free for biotin binding. 

However, as with Murtif and SanK>ls (ibid), PCT WO 90/14431, Cronan (ibid) and US-A-4782137, US-A- 
4839293 only discloses the use of a desired functional protein "upstream" of an identrficatnn protein. 

The problenos associated with the prior art methods can therefore be summarised as follows. First, they 

45 do not allow proteins and the like to be isolated in a high level of purity. Second, they do not allow proteins arYd 
the like to be isolated in high yields. Third, they do not allow proteins and the like to t>e separated in high purity 
and yield in a single chromatographic step. Fourthly, they do not provide for the alteration of the cart>oxyl ter- 
minal regbn biotin-binding pdypeptkle part of the hybrid polypeptides. In fact, and as discussed above, the 
prior art teachings (such as those of Murtif and Samols (ibid), PCT WO 90/14431 and Cronan (ibid)) clearty 

50 suggest that altering, for example, the hydrophobicity of the cart>oxyl terminal regk>n of the 1.3S polypeptide 
would eliminate bk>tinylatk>n and therefore prevent the isolatk>n of the hybrid pdypeptkJe. 

The solution to these problems, which the present invention provides, is a recombinant hybrid polypeptide 
(including methods ar>d nudek: ackJ sequences for producing same) comprising a polypeptide of interest fused 
to an avkj in-binding polypeptkie containing a biotin attachment domain wherein tfie polypeptide of interest is 

55 fused to the C terminus of the avklin-binding polypeptide (i.e. the polypeptide of interest (i.e the desired funo- 
tional protein) is "downstream" of the avkJin- binding polypeptkie (i.e. the klentification protein)). 

Thus, according to a first aspect of tfie present inventk>n there is provided a recombinant hybrid polypeptide 
comprising a polypeptide of interest fused to an avklin-binding polypeptkie containing a biotin attachment do- 
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main, characterised in that the polypeptide of Interest is fused to the C tenninus of the avidirhbinding polypep- 
tide. 

Preferably, biotin is attached to the avid in-binding polypeptide. 

Preferably, the polypeptide includes a cleavage site for cleaving the polypeptide of interest from the avidin- 
5 binding polypeptide. This cleavage site may be between the polypeptide of interest and the avidin-binding poly- 
peptide or it may be integral with either or both of the polypeptide of interest and the avidin-binding polypeptide. 

Preferably, the deavage site is aspartic acid- proline, asparagine-glycine, methionine, cysteine, lysine-pro- 
line, arginine-proline, lysine-argintne or isoleucine-glutamic acid-glycine-arginine. 

Preferably, the avidin binding polypeptide is, or is part of, a 1.3S polypeptide. Preferably, the 1.3S pdy- 
10 peptide is from Propionlbacterium. 

Preferably, the biotin attachment domain of the avidin-binding polypeptide comprises at least one of the 
sequence: 

15 Pro Ala Pro Leu Ala Gly Thr Val Ser Lys lie Leu Val Lys Glu 

Gly Asp Thr Val Lys Ala Gly Gin Tlu- Val Leu Val Leu Glu 
Ala Met Lys Met Glu Thr Glu He Asn Ala Pro Thr Asp Gly. 

20 

Preferably, the avidin-binding polypeptide comprises a plurality of non-contiguous and / or contiguous avi- 
din-bir>ding polypeptides, which may be the same or different 

Preferably, the polypeptide of interest comprises a plurality of non-contiguous and /or contiguous polypep- 
tides of interest, which may be the same or different. 
25 Preferably, the polypeptide of interest is an enzyme. Is an antigen useful for vaccine production, or a di- 
agnostic reagent. 

Preferably, the polypeptide of interest has antitumor activity or has an amino acid sequence for recognition 
of antigens. 

According to a second aspect of the present invention there is provided a nucleic acid sequence coding 
30 for a hybrid polypeptide comprising a polypeptide of interest and an avidin-binding polypeptide containing a 
biotin attachnrwnt domain, characterised in that the nucleic acid sequence coding for tiie polypeptide of interest 
is downstream of the nucleic acid sequence coding for the avidin-binding polypeptide. 
Preferably, the nucleic acid sequence is a DNA sequence. 

Preferably, the DNA sequence contains in a 5' to 3' direction on the coding strand a gene comprising a 5' 
35 promoter region, a DNA sequence coding for the avidin-binding polypeptide and the DNA sequence coding for 
a polypeptide of interest 

Preferably, the DNA sequence is, or is part of, an expression vector or a plasmid. 
According to a third aspect of the present invention there is provided a process for the production of a hybrid 
polypeptide according to the first aspect of the present invention comprising constructing a plasmid containing 
40 a nucleic acid sequence according to the second aspect of the present invention, transfonming the plasmid into 
a procaryotic or eucaryotic host cell expression system, expressing the system, contacting the hybrid polypep- 
tide resulting from the expression system with avidin, and harvesting the resulting avidin-bound hybrid poly- 
peptide. 

Preferably, the expression system is either E. Coli or insect cells. 
45 According to a fourth aspect of the present invention there is provided a process for the isolation of a hybrid 
polypeptide according to the first aspect of the present invention comprising contacting the hybrid polypeptide 
with avidin. 

PreferakHy, the avidin is monomeric avidin, tetrameric avidin or streptavidin. 

Preferably, the polypeptide of interest is cleaved from the isolated hybrid polypeptide. 
50 Preferably, the hybrid polypeptide is isolated using avidin covalently bound to a chemically inert, solid, wa- 
ter and solvent insolut>le sut>strate through a chemically stable non-hydrdyzable linking group. 

Preferably the hybrid polypeptide is isolated using avidin monomer affinity chromatography. Preferat>ly, the 
hybrid polypeptide is isolated using avidin monomer affinity chromatography disclosed in EP-A-0414785 
(90310154.1). 

55 EP-A-0414785 discloses a nwnomeric avidin polypeptide ligand and a novel and particularty efficacious 

process for isolating synthetic or natural molecules and / or biotinylated derivatives thereof, by adsorbtion of 
the rrwlecules of interest onto a novel affinity media which contains avidin fixed to a solid inert support 

According to a fifth aspect of the present invention there is provided a first Idt comprising a hybrid polypep- 
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tide according to the first aspect of the present Invention and avidin. 

According to a sixth aspect of the present invention there is provided a second kit comprising a nucleic 
acid sequence according to the second aspect of the present invention and avidin. 

According to a seventh aspect of the present invention there is provided a third kit comprising a nucleic 
5 acid sequence which codes for an avidin- binding polypeptide containing a bk)tin attachment domain and which 
is fusable to a nucleic acid sequence coding for a polypeptide of interest in order to form a hytMid nudeic ackl 
sequence according to the second aspect of the present invention and avidin. 

Preferably, the tiiird kit comprises means to fuse the nudeic ackl sequence coding for the avklin-blnding 
polypeptkle to tiie nudeic acid sequence coding for the polypeptkje of interest in order to form the nudeic ackJ 
10 sequence according to the second aspect of the present inventton. 

Preferably, any one of the kits comprises nieans to deave the polypeptkie of interest from the avkjin-binding 
polypeptkie. 

In accordance with the present inventk>n, therefore, a hytxid polypeptkle comprising one or more polypep- 
tides of interest and at least one polypeptide for attachment is produced by recombinant DNA methods. 
IS The present inventk>n also provkies a process for producing this hybrid pdypeptkJe in a procaryotic or a 
eucaryotk; protein expressk)n system. 

One of the advantages of the present invention is that it provides a means for isolating a hybrid polypeptide 
in high purity and yield. 

Another advantage is that the hytuid polypeptide according to the present invention can be recovered in 
20 high purity and high yield in a single chromatographic step, such as the avidin monomer affinity chronrtatography 
technique disdosed in EP-A-0414785. 

Another advantage of the present invention resides in the fijsk>n of the polypeptide of interest to the C- 
temiinus, and not to the N-tenninus, of the polypeptide containing the bk)tin domain to avidin (e.g. a 1.3S poly- 
peptide). 

25 In this regard, the protein expresskin level in a host cell is detenmlned by a number of factors, induding 

promoter strength and optimal initation of protein translation (see comnnentary at>ove). Promoter strength con- 
tributes to the effidency of transcription of messenger RNA. Optimizatk>n of the processes involved in the ini- 
tiation of translatk>n is important to achieving high levels of protein expression in the host cell. When polypep- 
tides of interest are introduced at the 3' terminus of the gene coding for the polypeptkje containing the biotin 

30 domain to avidin (e.g. a 1 .3S gene), no change is made to the optimal placement of the 5' terminus of the 1 .3S 
gene directiy adjacent to the promoter and 5' regulatory sequences. Thus, nnaximal expression levels in host 
cells can be achieved. 

In contrast to this, and as found with the prior art methods, If the polypeptkie of Interest is inserted between 
the promoter and 5' tenminus of, for example, the 1.3S gene additional expenminatk>n and tailoring is required 

35 to achieve maximal expressk}n levels in host ceils. 

The fusing the polypeptide of interest to the C terminus of the polypeptide containing the bk>tin domain to 
avidin (e.g. a 1.3S polypeptkle or a fragment of the 1.3S polypeptide), so that the correct conformatk>n of the 
biotin attachment regk>n may be preserved, is thus in direct contrast to the prior art methods (such as those 
disdosed in Murtif and Samds (ibid). PCT WO 90/14431, Cronan (ibkJ), US-A-4782137 and US-A-4839293, 

40 which in fact teach away from the present invention). 

Moreover, it was surprising to find that if one went against the teachings of the prior art methods (such as 
those disdosed in Murtif and Samols (ibid), PCT WO 90/14431, Cronan (Ibid), US-A-4782137 and US-A- 
4839293) and fused a polypeptide of interest to the C-terminus of the polypeptide containing the btotin domain 
to avUin, such as a 1 .38 polypeptide, the appropriate lysine reskiue of the 1 .38 polypeptide within the hytirid 

45 was indeed biotinylated. This is surprising since, if one were to fdlow the eariier teachings of Murtif and Samols, 
the addition of a polypeptide substantially longer than two amino add reskJues wouki be expected to alter the 
conformation of the 0 temninus of the 1.3S polypeptide and thus preclude bk>tiny1ation. 

It 18 also surprising to find that, contrary to the teachings of Cronan (ibid) and Murtif and Samols (ibid), when 
a polypeptide of interest is fused to the polypeptide containing the biotin domain to avklin, such as a 1.3S poly- 

50 peptide, at the C-terminus of the 1.3S polypeptide, bk>tin Is attached to the biotin-attachment don^ain of 1.3S 
polypeptide within this hybrid recombinant polypeptide. In tills regard, Cronan (ibid) and Murtif and Samols (ibid) 
teach that fuston at the O-terminus of the 1 .38 may disrupt the native hydrophobicity and thus the native con- 
formatton of ttie 1.3S polypeptide, thus Inhibiting btotinylation, and oonsequentiy inhibiting the binding of hybrid 
polypeptides to avklin. 

55 It is further surprising to find that the bk>tin group attached to the polypeptide containing the bk>tin donnaln 
to avklin, such as a 1.3S peptide, fused at Its C-terminus to another polypeptide is positioned so as to make 
the bkitin molecule avalable for binding the hybrid polypeptide to the avkjin monomer affinity resin of EP-A- 
0414785. 



6 



EP0 511 747 A1 



Furthermore, the hybrid polypeptide is selectively retained by the avidin resin and can be recovered in high 
yield and high purity. 

In the present invention, a binding domain, or recognition sequence, directs the attachment of biotin to the 
hybrid polypeptide. The blotinylated hybrid polypeptide can then be specifically selected by affinity ligand conv 
5 positions such as the avidin monomer resin of EP-A-0414785. 

The present invention therefore yields a single purrfication step that can separate the protein of interest 
from complex mbctures, such as crude bacterial lysates, with high levels of recovery in a single chromatographic 
step, thus alleviating the recovery problems inherent to muttistep purification processes. Such a combination 
of hybrid polypeptide and avidin nnonomer affinity resin would clearly confer significant advantages to the pur- 
10 ffteation of commeitrially usefiil polypeptides over existing processes. 

An avidin-blnding polypeptide for attachment is generally a polypeptide that enables the attachment of a 
hybrid fusion polypeptide to avidin. AnDong such polypeptides for attachment that may be used are those con- 
taining a recognition sequence for attachment of the prosthetic group biotin. such as the 1.3S polypeptide sub- 
unit of transcart>oxylase from Propionibacterium. 
15 In addition to using the entire sequence of the 1 .3S polypeptide as the polypeptide for attachment, other 
smaltar portions of the 1.3Spoiypeptidemay be used which direct ttie attachment of biotin, particutariy portions 
comprising all or part of amino acid rasidiies 58 through 100 (SEQ ID NO: 2). 

One or more deletions, substitutions, insertions or mutations may be made by methods well Icnown in the 
art which result in a biotinylabed 1.3S polypeptide or blotinylated fragment The nucleotide sequence coding 
20 for the 1 .38 polypeptide or fragments may be synthesized using a conmiercially avaSable DMA synthesizer in 
a manner wall known in the art 

Additionally, otiier polypeptides or portions thereof that are enzymaticaly biotinylated nr^ay also be env 
ployed. 

Unless indicated otherwise, the term "avidin" includes streptavidin. 
25 The polypeptide of interest may include two or nrK>re polypeptides of interest 
The two or more polypeptides of interest may t>e fused sequentially. 
Optionally, contiguous polypeptides of interest may be fused sequentially. 

Additionally, more than one polypeptide of interest may be present in a noncontiguous arrangement, for 
example, one polypeptide of interest may be fused to the N-terminus of the polypeptide for attachment and one 
30 polypeptide fused at the C-terminus of the san^ polypeptide for attachment 

Two polypeptides of interest may be fused to the C-tennini of two polypeptides for attachnnent arranged 
as: polypeptide for attachment 1 - polypeptide of interest 1 - polypeptide for attachment 2 - polypeptide of in- 
terest 2. 

In any of ttie arrangements disclosed here, the polypeptides of interest may be the same, or different The 
35 polypeptides for attachment may be the same, or different 

An advantage of fusing a plurality of polypeptides of interest to at least one polypeptide for attachment is 
the ability to increase ttie yield of a single polypeptide of interest present by including two copies of that poly- 
peptide within a single hybrid polypeptide, and/or to increase the number of polypeptide species that can be 
purified simultaneously by a single avidin affinity chronnatography step, if the polypeptides of interest are dif- 
40 ferent 

A cleavage amino acid or sequence of amino acids may be present between the polypeptide containing 
the biotin domain to avidin and the polypeptide of interest 

Likewise, a cleavage amino acid or sequence of amino ackis may be present between the potypeptkles of 
interest rf two or more of such polypeptides are present Such linking, or cleavage, amino acid(s) permits the 
45 separation of polypeptides at a specific site or sites on the hybrid polypeptide when it is treated with the ap- 
propriate chemical reagent or enzyme. If desired, the cleavage site is positioned adjacent the polypeptkje of 
interest so that the polypeptide of interest may be cleaved from the polypeptide for attachment. 

The hybrid polypeptide itself may contain a linking amino acid or amino ackJs for cleaving the polypeptide 
or polypeptides of interest from the polypeptkJe or polypeptides for attachment. The linking amino ackJ or amino 
50 acids are incorporated between the polypeptide or polypeptides for attachment and the polypeptkJe or poly- 
peptides of interest in such a way that one or more cleavage reactions separate each polypeptide species to 
the degree necessary for intended applications. It may not in every instance be necessary to deave all, some, 
or any of the species within a particular hybrid polypeptide. 

Amino acids that may be used to linked{ the pdypeptkie of interest to the polypeptide for attachment Include 
55 aspartic acid-proline, asparagine-^lycine, methionine, cysteine, lysine-proline, arginine- proline, isoleucine-glu- 
tamlc acid-glycine-arginine, and the like. 

The at least one polypeptide for attachment may be cleaved from the at least one polypeptide of interest 
by exposure to the appropriate chemical reagent or cleaving enzyme. 
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It should be recognized that cleavage of the polypeptide or polypeptides of interest from the polypeptide 
or polypeptides for attachment may not be necessary for every hybrid fusion polypeptide that is constructed, 
in which case a cleavage site could be incorporated, or absent. 

The polypeptide of interest can comprise substantially any procaryotic or eucaryotic polypeptide that can 
be expressed by a vector in a host cell. Among the polypeptides of interest which may be produced by such 
means are enzyn^es. such as proteases, oxidoreductases, transferases, hydrolases, lyases, isomerases or lig- 
ases. 

The present invention also contemplates the production of storage polypeptides, such as fenitin or oval- 
bumin or transport polypeptides, such as hemoglobin, serum albumin, eruloplasmin, or the like. Also included 
are the types of polypeptides that function in contractile and motile systems, for example actin and myosin or 
the like. 

The present inventton also contemplates the production of polypeptides that serve a protective or defense 
function, such as the Wood polypeptides thrombin and fibrinogen. 

Other protective polypeptides include the binding polypeptides, such as antibodies or immunoglobulins that 
bind to and thus neutralize antigens. Addittonally this invention contemplates Protein A, or the like. 

The polypeptide produced by the present invention also may encompass vanous hormones such as en- 
dorphins, human growth honmone, somatostatin, prolactin, estrogen, progesterone, thryotropin, calcitonin, go- 
nadotropin, insulin or the like. 

Other such honmnes include those that have been identified as being involved in the immune system, such 
as interleukin 1, interieukin 2, colony stimulating factor, naacrophage-activating factor, interferon, or the like. 

The present invention rr>ay be used to produce toxte polypeptides, such as ricin from castor bean or gos- 
sypin from cotton seed, and the like. 

Polypeptkies that serve as structural elements may be produced by the present invention, such polypep- 
tides include the fibrous polypeptides collagen, elastin and alpha-keratin. 

Other structural polypeptkies include glycoproteins, virus-proteins, muco-proteins and the like. 

PdypeptkJes that may be utilized as diagnostic agents, for example as nnarkers for the presence of certain 
diseases, are also contemplated by this invention. 

Additional polypeptides of interest that may be produced as hybrid polypeptides are polypeptides that may 
be used for therapeutic purposes, for example polypeptkJes with anti-tunrmr activity, polypeptides useful in vac- 
cine productfon. polypeptides having amino ackl sequences for recognition of antigens, or polypeptides which 
can function as diagnostk; reagents, and the like. 

In addition to the above-noted naturally occurring polypeptides, the present invention may be used to pro- 
duce synthetic polypeptides, defined generally as any sequence of amino acids not occurring in nature. 

Preferably, the hybrid polypeptkie is produced in procaryotic or eucaryotic cells transformed by a cloning 
vector comprising a nucleic ackJ sequence according to the present invention. The hybrid polypeptide is then 
purified away from the complex cell extract mixture by avidin affinity chromatography. A partfculariy prefen-ed 
form of avidin is avidin mononrier. 

In a prefered embodiment, an extract of transformed cells is made from cell culture or fermentation broth, 
the hybrid polypeptide is then rendered to a soluble state, and the extract is then applied to the avidin rTK>nomer 
column. The column is then washed with adequate amounts of a wash buffer to dear the column of unbound 
HDaterials. The hybrid polypeptide is then eluted from the column. 

After the hybrid polypeptide is eluted from the column, the pdypeptkJe for attachment may optfonally be 
cleaved from the polypeptide of interest with the appropriate cleavage reagent or enzyme. Passage of the 
cleaved mixture over the avidin monomer column yields a highly purified preparatk>n of the polypeptide of in- 
terest, and the polypeptide for attachment is then retained by the column. 

The avidin used to bind the btotin attached to the hybrid polypeptkie may be nronomeric or tetrameric av Win, 
or streptavidin. Avidin nronomer is the preferred form of avidin affinity medium. 

Advantages of using avidin monomer to separate the polypeptide of interest from crude cell mbctures in- 
clude reversible binding of the polypeptkie for attachment to avidin, high yield, and high purity of the desired 
polypeptkie of interest following affinity chromatography. 

The genes coding for hybrid polypeptides may be produced by recombinant DNA methods by combining 
within a DNA expressk>n vector a chin>eric gene comprising a 5' promoter region, DNA sequences coding for 
at least one polypeptide for attachment of a prosthetfc group for binding to avWin, and at least one DNA se- 
quence coding for a polypeptide of interest. 

Optionally, the chmneric gene n>ay contain at least one DNA sequence coding for a linking amino add or 
amino ackJs, that is, one or more amino ackls for cleaving a polypeptide of interest from a polypeptide for at- 
tachnnent 

Genes coding for the various types of polypeptides of interest, for example those Wentified above, may be 
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obtained from a variety of procaryoBc or eucaryotic sources, such as plant or animal cells or bacterial cells^ 

Sn^l!Zi^^ ftlSLmal material of eucaiyotfcorproca^ 

of Diocarvotic or eucaryotic cells by employing standard, well-known techniques. 

"S^X. r^-ted DNA synth-i n«y be used to obtain DNA coding for '^^^'^^"^^^ ^ 
thetic polypeptides. To enable chimeric gene expression in host cells, a vanety of natural ly-occumng and^ 
DnI expression vectors having genes coding for many different pdypeptWe molecules are now com- 
merciallyavalable from a variety of sources. w.»„o-^«ioo« Thi««i- 

The desired DNA can also be produced fiom mRNA by using the enzyme reverse transcnptese. Tha en- 
zyme permits the synthesis of DNA from an RNA template. . . J , 
^n^ordance With the present invention, once genes coding for one or r^ore desired Po'VPeP^e^^ 
teiest are isolated synthesized or otherwise obtained, said gene or genes are joined to at least one gene cod ng 
Ta ISJ^STe c^Zing a recognttion sequence for attachment of the pmsthetic group biotin. thus enabhng 

rgTnrd^r r::,^;^^;- for attachment . gen^, one ^.g f. a p.y^P«^ 
that enables the binding of a hybrid fusion polypeptide to avidin. Among such genes coding for PO«yPepMes 
Ta^^at may'be us^^l are those coding for amino acid sequences tt«t direct a^t^f ^r^^^ 
S^Lc group biotin . for example the gene for the 1 .3S polypeptide subunit of transcart)Oxylase fiom Pfo- 

"""iSrSling for a polypeptide for attachment may be one that d^ects the attachment of the prosthe«c 
group K-Tp^ularTbSotinylated preferred biotinylated polypeptide for a«achment J^^^^ jAunJ^ 
STLboxylaw Iton P«,pionlbacterium shennanB (SEQ ID N0:1). Atthough the gen* coding tor »e ertje 
n^SySfid. from PrSonit-cterium .hem«nli i. pr.fe,.Bd(8EQ ID NO: 4). ojionaiy any 

fragments is well known to those skilled in the art ^ 

The gene or genes coding for the at least one polypeptide of interest and the gene or genes cod.^^^ 

DolvpepMe or p<iypeptWes for attachment am preferably treated with the appropnate restnctran enzymes, or 
^^l^L iThave cohesive tem,ini to faclitate ligation with other elements of the chimenc gene or the 

'''' V^^S^'^^J^ expmssion vector carrying the chimeric hybrid polypeptide gene is used to transform 
the apSo^tB Srocaryotic or eucaryotic host cell. The selection of a DNA expresston vector appropriate for 
the desired host cell is well known to those skilled in the art. , 

F^^Sng the Iransfomiatton procedure, the transfomied host cells are isolated and analy»d for expras- 
sion of the hybrid polypeptide. Tfiose transfom«nts identffied as containing the hybrid pdypepbde are fiirther 
restrirtionenzyme digestton. DNA sequencing and other methods for confimiing the correctness 
of the desired oene. by methods well known to those skilled m the art 

^li^^Lanls identified as hostcellscarryingthegeneforthedes^^^ 
tiplied in culture to cause replication of the vector and high-level expresston of the hybnd polypeptide that con- 

nil'cS^- ^ -ed to iransfom, addittonally other strains of compaUble hosts for larg^scaie 

'^ro:s°i:;S^s:r:2iin^ genes or gene fragments, preparing DNA ex^sion vectors. J.ns- 
fbmlg host cells, expressing hybrid polypeptides in host cells, and identifying th<«e P^I^P^P^^^f^^f /"^ 
by J. ^brook. E.F. Fritsch. and T. Maniatis. MdecularCloning. 2nd Edition. Cold Spnr^ Hart>or ^es j «89 
and also by F.M.Ausubel. R. Brent R.E Kingston. D.M. Moore. J.G. Seidman.J^A. Smith. K. Struhl.. Eds. Cur- 
rent Protocols in Molecular Biology. Volume 1 . John Wiley and Sons. New York 989^ 

TopraparaDNAexpression vectors, variousdoning vectors may be used Aplasmid.sprrfe^.H^ 

a cosmW or bacteriophage may be used. If insect, plant or mammalian cells are used as host cells, viruses 
^rJo Z used as veJ«s. DNA expression vectors may be obtained fmm natural sources <>r ™y <;0;nP"^ 
synthetic DNA. The plasmid chosen for a particular expresston system should be compatWe wrth that h^ to 
rsIi^vedtoTrepliclnandpolypeptW^ 

for the hybrid polypeptide should possess an origin of replication recognized by the host cell. 

The DNA exp^ion vector should contain DNA sequences recognized by restricUon endonude^e en- 
zymes to deave the vector for subsequent ligation with the gene for the hybrid polypeptide v^thout inac^^^ 

J^in of replicalionorfunctions necessary for plasmid selectionfollowingtransfom^^^ 
an antibiotic resistance gene. The vector should contain restriction enzyme deavage sites that provide surtable 
tennini for joining and ligatfon of foreign genes to be inserted. . , . „ ,fh^ i„,k™i nni« 

Preferably, the DNA vector contains a single site or two unique sites tor incorporation of the hybnd poly- 
peptide gene, neither of which occurs within that gene. 
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To accommodate more than one different foreign gene possibly tenminating in different cohesive or blunt 
temiini, it would be useful for the vector to possess a large number of unique restriction enzyme cleavage sites. 

Preferably, the DNA expression vector will cause a phenotype to be expressed that will enable transfomned 
cells to be readily identified and separated from cells which do not undergo transfonmation. Such phenotypic 
5 selection genes can indude genes providing resistance to a particular antibiotic, which inhibits growth of un- 
transformed but not transfonmed cells. Such genes are widely available now and confer resistance to antibiotics 
such as ampicillin, tetracycline, streptomycin, kanamycin, and the like. 

Plasmids which contain an inserted gene that disrupts the B-galactosldase gene, such as the hybrid poly- 
peptide gene, can be identified following transformation by the inability of the host cell to reduce reagent 5- 
10 bromo-4-chloro-3 indolyl-b-D-galactopyranoskle (X-gal) in the noedia and cause the bacterial colony to develop 
a blue coloration. Such plasmids. reagents, and media are known to those skilled in the art 

Preferably. E. coli is employed as the host cell, and a plasmid is prefen^ for doning and transfomnation 
of the E. coli host The preferred plasmid is pKK223-2(PhamDada. Uppsala, Sweden). This plasmid carries 
genes for an origin of replication in E. coli, and a gene for resistance to the antibtotic ampicillin. This plasmkl 
IS also has a synthetic linker regk)n consisting of unique restriction endonuclease deavage sites to facilitate don- 
ing. This plasmkl contains the strong tac promoter, which directs high levels of transcriptton in E. coll. 

If Insect cell culture is to t>e used for production of hybrid polypeptkle, the prefen-ed plasmid Is pVL1392 
(obtained from M. Summers. University of Texas, available commerdally finom Invitrogen, Inc. San Diego CA). 
An advantage of insect cell culture is that polypeptWes requiring glycosylatk)n or other types of post trans- 
20 lational modfications induding folding with appropriate disulfide bond fDnmatk>n may be so modified in an insect 
cell expression system, whereas this manner of post-translattonal modificatfon is not perfonfned by procaryotic 
hosts. 

To prepare the chosen plasmid for insertton of the chimeric gene comprising the hybrid polypeptkle gene, 
the plasmkl Is digested with restriction endonucleases, for example BamHI or EcoRI, or any restriction enzynne 
25 or enzyme combinatk>n that deaves the plasmid at a unique site and produces cohesive 3' and 5' termini conrv 
plementary to termini of the chimeric gene to be ligated. If desired, the plasmid may be treated with two different 
enzymes to produce two different cohesive tenmini to facilitate ligation of the chimeric hybrid polypeptkle gene 
or genes in the con-ect orientation within the plasmid. Certain enzynnes which produce blunt ends may also be 
used, or linker rrK)lecules may be added to vector or foreign genes to prepare the desired cohesive termini. 
30 Such strategies and methods are well known to those skilled in the art 

When the plasmid is digested, two or more DNA fragments nnay be generated. The desired plasmkl frag- 
ment carrying the origin of replication and other genes essential to replication and identification of the plasmkl 
may be klentified and recovered by gel electrophoresis and other techniques well known in the art. 

A particulariy preferred arrangennent for the members of the hybrid polypeptkle is the locatton of the poly- 
35 peptide for attachment, most preferably the 1 .3S polypeptkle directly 3' to the promoter at the PstI site within 
the synthetk: linker of the plasmid pkk223-3. The PstI site is 3' to the promoter and to a ribosome binding site. 

It should be understood that any deletk>ns, insertrons, substituttons. or mutations which may be performed 
on the 1 .3S gene which still direct the attachnnent of btotin are contemplated within the spirit and scope of this 
Invention. Additionally, other genes or gene fragnrwnts, natural or synthetic, whose resulting polypeptides direct 
40 the attachment of btotin or lipoic acid fall within the scope and spirit of this invention. 

The 1 .3S gene is preferably constructed witti a deavage site for PstI at its 5' terminus and a cleavage site 
for BamHI at its 3' terminus, such that upon ligation, the 1.3S gene is connected in the proper reading frame 
with the tac pronrwter, a ribosome binding site is intact, and the 1 .3S gene or fragment preferably terminates 
in tiie nudeotWe sequence GAT CCA TAA CGC CTA AGC TT (SEQ ID NO: 3), or any such sequence which 
45 simultaneously provides a BamHI restrictton endonuclease deavage site and codes for the amino ackls asp- 
pro. Asp pro is the preferred sequence used as linking amino acids the cleavage of the 1.3S polypeptide for 
attachment firom appropriate polypeptides of interest 

However, if a polypeptide of interest contains within its sequence one or more asp-pro sequences, then 
optionally any other linking amino add or amino ackls not present in the polypeptide of interest may be sub- 
so stituted for asp-pro. It will be necessary to structure the gene in such instances that appropriate cohesive termini 
are created that permit ligation of tiie 1 .3S gene to the gene for the polypeptkle of interest 

The gene or genes for the at least one polypeptide of interest nrvay be isolated, syntiiesized. or otherwise 
obtained and nrK>dified at the 5' terminus so that ligation to the appropriate terminus of the gene for the poly- 
peptide for attachnrwnt is facilitated. The 3' terminus of the 1.3S polypeptkle for attachment is the preferred 
55 terminus for ligation of the gene for the polypeptide of interest in the proper reading frame. 

Furtherrrx)re. the 3' terminus of the last polypeptide of interest in sequence in the chimeric gene shouW 
preferably be prepared so that this terminus is complementary to the 5' terminus of the plasmkl vector, to fa- 
dlitate ligation to the expresston vector. 
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It is to be understood that for different chinneric genes, obtaining ttie correct orientation of ttie polypeptide 
or polypeptides of interest relative to each other and to the polypeptide or polypeptides for attachment within 
the expressbn vector employ the t>asic steps as outlined at>ove. 

Preferably, the gene for each polypeptide of interest is attached In the proper reading frame to the adjacent 
5 gene, and all adjacent termini are prepared in a manner so as to nr^ke them complementary to facilitate ligation. 

Furthermore, genes for any polypeptides of interest requiring cleavage from another polypeptide of Interest 
or from one or more polypeptides for attachment must be so constructed as to allow for the proper positioning 
of all cleavage sites so that their insertion does not result In any genes for any of the polypeptides being in an 
improper reading frame. 

10 The ligation reaction, which covalently joins fragments of DNA, is described in Sambrook, Fritsch, and Man- 
iatis (ibid), and Ausubei et al. (ibid) and Is well known to those skilled in the art 

The ligated plasmki Is ready for transformation of host cells. The preferred host is E. cdi, however, other 
bacteria, insect cells, yeast, or mammalian or plant cells may be used with a DNA expression vector appropriate 
to that particular host cell. 

15 Transformation of E. coli Is a standard procedure welt known to those skilled In the art wherein a suitable 
host strain, such as E. coli HB101 accepts, harbors, replrcates and expresses the plasmid carrying the gene 
for the hybrid polypeptkje. Transfonmation of E. coli is described by Sambrook, et al. If the host is an insect 
cell, transfection may be accomplished by a procedure such as that described by M. Summers and G. Smith, 
A Manual of Methods for Baculovlrus Vectors and Insect Celt Culture Procedures. Texas Agricultural Experi- 

20 ment Station Bulletin No. 155, 1988. 

In order to identify the host cells which are transfomned, the culture is placed in selective media containing 
an appropriate antlbk)tic. Only those cells with plasmid- borne resistance will survive. 

Plasmkl can be recovered after lysis of surviving cell cok>nies, and characterized by restriction enzyme 
digestion and mapping, DNA sequencing, or other methods known in the art. Additionally, those colonies which 

25 express hybrid polypeptide can be kientrfied by immunological assay, such as ELISA or Western blotting. In 
some emt)odiments It may be possible to assay directly for bk>logk^l activity of the polypeptide or polypeptides 
of interest 

Once transformed cells carrying the hybrid potypeptkJe are klentrfied, they may be multiplied by established 
techniques, such as fermentation. In addition, the recovered plasmkis can be used to transform other strains 
30 of bacteria, or appropriate hosts cells for large-scale production and expressk)n of the hybrid polypeptkje. 

The hybrid polypeptide which contains the polypeptide for attachment the biotin group for binding to avidin, 
the polypeptide of interest and tiie optional cleavage site, expressed by the transformed host cells may be 
separated from the nnedium and other debris by affinity chromatography. 

The prefisrred affinity medium is the avidin monomer resin descrit>ed in EP-A'0414785. To this end, host 
35 cells are separated from the medium and broken open for example, by sonication. 

Optionally, hybrid polypeptides can t>e excreted into the culture media if a signal peptide for extracellular 
secretion is included at the appropriate terminus of the hybrid polypeptide. 

Should such a secreted polypeptide be desired, it may be necessary to include a DNA sequence coding 
for a polypeptide directing extracellular secretion within the chin>eric gene coding for the hybrid polypeptide. 
40 The hybrki polype ptkie once released is maintained in an appropriate huffier, preferably one in which It is 

soluble. The buffer solution should be formulated to maximize hybrid polypeptide recovery from host cells. Buf- 
fer properties whksh may t>e optimized to favor recovery include but are not limited to, pH, ionic compositkm, 
k)nlc strength, or presence or absence of various detergent compositions. 

Optionally sonr>e fractionation of tiie host cell extract nrmy be performed in order to concentrate or partially 
45 purify the hyt>rid polypeptkje prior to affinity chromatography. 

One preferred metfiod is amnDonium sulfate fractnnation. It is to be understood that other methods conv 
monly employed In protein purification may also be used prior to affinity chromatography. The cell extract is 
passed over tiie preferred column for affinity chromatography, the avidin nfK>nomer column, which is then wash- 
ed extensively with buffer to remove all unbound materials. 
50 The hybrid polypeptide is specitically eluted from the column, preferably with acetic acki or biotin. As as 
result a high yiekJ of highly purified hybrid polypeptide containing the polypeptide of interest is obtained. 

It may be deslrat)le or necessary to cleave the one or more polypeptkje of interest from the one or nnore 
polypeptide for attachment to restore bk)logical activity to the polypeptkle of interest 

Separation from the polypeptide for attachment may be accomplished by first suspending the hybrid poly- 
55 peptide in buffer. Thereafter tiie chemical or proteolytic cleavage agent specific to the linking amino acid or 
amino acids is added to the suspension and tiie polypeptkJe of interest is cleaved. 

For example, rf the polypeptide of interest Is linked to the polypeptide for attachment by an asp-pro linkage, 
a volatile acid such as fonmic acid nr>ay be added to tiie suspension to effect cleavage. 
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If methionine is the linking amino acid, the reagent cyanogen bromide may be used to cleave between me- 
thionine and the first amino acid of the polypeptide of interest 

A volatile cleavage reagent, such as fomiic acid or cyanogen bromide, may be evaporated away from the 
polypeptide mbcture. If deavage is accomplished by an enzyme, the enzyme may be removed from the mixture 
5 by passing the mixture through an enzyme substrate column. 

If it is necessary to obtain the polypeptide of Interest pure from the polypeptide for attachment, this renDoval 
may be accomplished by passing the mbcture through an avidin affinity column. In this way, the polypeptide for 
attachment, by binding to the avidin, will be retained and therefore separated from the highly purified solution 
of the polypeptide of interest, which will not bind to avidin. 
10 It should be noted that some polypeptides of interest will assume their desired biological activity with the 
polypeptide for attachment still attached. As a consequence, the polypeptide for attachment will not need to 
be cleaved from the polypeptide of interest and the steps descrit>ed to separate the polypeptide of interest and 
the polypeptide for attachment need not be perfonrned. Moreover, in circumstances where the polypeptide for 
attachment rennains attached to the polypeptide of interest, linking amino ackl or amino ackis may be present 
15 or omitted. In this situatk>n, the constmctk)n and method of preparing the DMA expression vectors, detaied 
above, can be appropriately modified. 

The present invention will now be described by way of examples only. 

Reference shall be made to the following figures, in which: 

Figure 1 shows a partial restrictk>n nr>ap of plasmid ptac1.3dp; 
20 Figure 2 shows chimeric gene constructs for hybrid poiypeptkjes constructed so that the polypeptkJe of 

interest is fused at the C-tennriinus of the polypeptkJe for attachment; 

Figure 3 shows chimeric genes for hybrid polypeptides containing more than one polypeptide of interest 
fused to a single polypeptkle for attachment; and 

Figure 4 shows chinDeric gene constructs for a hybrid polypeptide containing two noncontiguous polypep- 
25 tides of Interest, each fused to the C-terminus of noncontiguous polypeptides for attachment. 

Referring to Figure 1, a partial restriction map of plasmid ptac1.3dp is shown. ptac1.3dp was created by 
nfK>dificatk>n of plasmids ptad.St and ptad. 3(1-1 25) obtained from D. SanK)ls, Case Western Reserve Uni- 
versity. An E. Cdi strain CSH26 containing the plasmkJ ptac1.3dp has been deposited in the American Type 
Culture Collection, Rockvitle, Md, USA as ATCC No. 68937. This deposit was made pursuant to the Budapest 
30 Treaty On The Intemattonal Recognition Of The Deposit Of Microorganisms For The Purposes Of Patent Pro- 
cedure. 

Referring to Figure 2, chimeric gene constructs for hybrid polypeptkJes are shown. The chimeric genes are 
constructed so that the polypeptkle of interest is fused at the C-terminus of the polypeptide for attachn>ent In 
these examples, the polypeptide of interest is a synthetic t>-endorphin, and the polypeptide for attachment is 
35 the 1 .3S polypeptkle from transcarboxylase of Propionibacterium shermanii. 

In Figure 2A is shown an asp-pro cleavage site located t>etween the C-terminus of the 1.3S polypeptide 
and the N-terminus of the t>-endorphin polypeptkle. 

In Figure 2B is shown an asp-pro cleavage site located between the C-tenminus of the 1.3S polypeptide 
and the N-terminus of a novel reverse-endorphin polypeptkle. 
40 In Figure 2C is shown a methionine cleavage site located between the C-tenninus of the 1 .38 polypeptide 

and the N-tenfninus of the b-endorphin polypeptkle. 

In Figure 2D, no cleavage site is located t>6tween the C-terminus of ttie 1.3S polypeptide and the N-ter- 
minus of the b-endorphin polypeptide. 

Referring to Figure 3, chimeric genes for hybrid polypeptides are illusfrated whbh contain more than one 
45 polypeptkle of interest fused to a single polypeptkle for attachment 

In Figure 3A is shown the fusk)n of two contiguous b-endorphin polypeptides to the C-tenminus of the 1.3S 
polypeptkle firom transcarboxylase of Propionibacterium shennanii. 

In Figure 38 is illustrated the fusion of two different noncontiguous polypeptkles of interest to the 1 .38 poly- 
peptide. A maltose binding protein is fused to the N-terminus of the 1 .38 polypeptkle, and a synthetic b-endor- 
50 phin is fused to the C-terminus of the same 1 .38 polypeptide. 

Figure 3C shows the fijsk)n of two different contiguous polypeptides of interest to the N-terminus of the 
1.38 polypeptkle. 

The maltose binding protein and a synthetic b-endorphin polypeptkle are fused in tandem to the N-terminus 
of the 1.38 polypeptide. 

55 Referring to Figure 4, chimeric gene constructs for a hybrid polypeptkle containing two noncontiguous poly- 

peptides of interest are shown. Each coding sequence is fused to the C-terminus of a noncontiguous polypep- 
tides for attachment In the specific example, two synthetk: t>-endorphin polypeptides are each fused to the C- 
temninus of a different 1 .38 polypeptkle. 
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The following examples are carried out using one or more of the general procedures set forth t>elow. 
In all of the following examples, restriction endonudeases, ligases, polymerases, and other DNA nrxxiifying 
enzymes described In specific experimental steps are used according to the recommendations of the manu- 
facturer of the particular enzyme or reagent used. 
5 Two lat>oratory manuals, Current Protocols in Molecular Biology (Ausubel et al 1989)), and Molecular Clon- 

ing A Laboratory Manual 2nd Edition (Sambrook et al, Cold Spring Harbor Press, Cold Spring Harbor, NY 
(1989)) referenced below, contain supplemental information that nrtay also be useful to one skilled in the art In 
conducting the examples descrit>ed below. 

10 Procedure I. Restriction Endonudease Digestion of DNA 

Restriction endonudease digestions using one or more restriction enzynrtes to digest DNA are generally 
carried out using the protocols set forth in Ausubel etal. Current Protocols in Molecular Biology Volume 1 , Chap- 
ter 3, Unit 3.1. 

15 Restriction mapping of plasmids is generally carried out using the protocols set forth in Ausubel et al (ibki), 

Unit 3.2. Restriction enzymes are obtained from Promega (Madison Wl) or New England Biolabs (Beverly MA) 
and complete or partial digestion of DNA with specific enzymes are performed generally accordirig to the man- 
ufacturer's recommendations . 

20 Procedure il. Purification of DNA Fragments Using Agarose Gei Eiectorphoresis. 

Agarose gel electrophoresis is generally canried out using the protocols set forth in Ausubel et al (ibkj). 
Volume I, Chapter 2, Unit 2.5A. Separation and isolation of larger (>1 kb) DNA fragments from excised gel frag- 
ments is generally carried out as set forth by Ausubel et al (ibid). Chapter 2, Unit 2.6, and separation and iso- 
25 lation of smaller (<1 kb) DNA fragments Is carried out as set forth In Ausubel et al (ibid) Chapter 2, Unit 2.7. 
Removal of salts and gel fragments firom larger DNA fragnDents is accomplished using the GeneClean kit 
(Biol 01 , Inc. San Diego, CA) using procedures supplied by the manufacturer, and rennoval of salts and gel frag- 
ments from smaller DNA fragnDents is accomplished using the MerMaid kit (Biol 01, Inc. San Diego, CA) also 
using the procedures reconwnended by the manufacturer. 

30 

Procedure ill. Ligation of DNA Fragments. 

Ligatk)n of DNA fragments using T4 DNA Ligase (New England Biolabs Beverly MA) is generally carried 
out as in Ausubel et al (ibid). Unit 3.14, using conditions recommended by the manufacturer. 

35 

Procedure IV. Preparation of Competent E coii Celis and Transformation of E. coii. 

Preparatk>n of competent E. coii CSH26 cells using calcium chloride and transfonmatk>n with DNA expres- 
sion vectors are carried out using the protocols described by Sambrook et al Molecular Cloning A Laboratory 
40 Manual 2nd Edition 1 989, Chapter 1 . 

Clones carrying plasmkis containing DNA insertions are identified by growing cells on L-agar supplemented 
with 100 mg/L ampicillin. 

Procedure V. isoiatlon of Plasmid DNA. 

45 

Plasmid DNA Is generally usubel et al (ibid) Chapter 1, Unit 1.7. 
Procedure Vi. Preparation of Doubie-stranded DNA from Synthetic Single-stranded Ollgonucieotides. 

50 Ollgonucleotkles are synthesized using standard phosporamidlte chemistry (0.2 micromde synthesis). 

Fragnr>ents are separated on a 20% polyacrylamide gel under denaturing condltk)ns as described by Ausubel 

et al (ibid) Volume 1, Unit 2. 12 and eluted and desalted as described therein. 

Double-stranded DNA is assembled firom upper strand and lower strand pairs of synthetic oligonucleotides 

with overiapping regions of perfect complimentarity of 15 nucleotides at their 3' ends by heating a mixture of 
55 1 ug of each of the strands at 90°C for 5 minutes, followed by slow cooling to room temperature over a period 

of one to two hours. This short duplex region serves as template and primer for mutually primed synthesis of 

a complete double DNA strand with Sequenase, a T7 DNA polymerase obtained from US Biochemical, using 

protocols supplied by the manufacturer. 
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Following duplex extension, the DNA double strand is purified by agarose get electrophoresis as described 
in Procedure II. 

Procedure VII. Preparation of Crude E. coll Ceil Extract 

5 

E. coli host cells hart>oring the plasmid carrying the gene for the hytMid polypeptide are grown to stationary 
phase by overnight incubation In L-broth containing 100 mg/L ampicilin at 42^C, 250 RPM in a New Brunswick 
incubator-shaker. 

Celts are collected by centrifugatbn In a GSA rotor at 5,000 X G for 30 PDinutes at 4*'C In a Sorvall RC-5B 
10 centrHuge to pellet the celts. The supernatant is poured off. and the cell pellet is weighed. The cells are resus- 
pended in 1:2 (cetl wet weight buffer volume) 100 mM potassium phosphate buffer, pH 6.8-7.2 (or appropriate 
buffer at pH 4.0 to 1 1 .0). at 4**C. 

The celts are then lysed by sonicatk>n using a large probe on Fisher sonic dismembranator Model 300 for 
3 one minute cycles at 95% relative output. The resulting lysate is centrifuged at 17,500 RPM for 30 minutes 
15 at 4''C. To the resulting supernatant is added 2% (w: v) streptomycin sulfate. After incu batk>n for 1 5 to 30 minutes 
at 4*^0, the lysate Is centrifuged at 17,500 RPM as at>ove. The resulting supernatant Is adjusted to 30% satur- 
ation with ammonium sulfate, incut>ated for 30 minutes at 4^0, and centrifuged at 17,500 RPM as above. The 
supematant is adjusted to 60% saturation with ammonium sulfate, incubated for 30 minutes at 4*^0, then cen- 
trifuged at 17,500 RPM as above. 
20 The pellet formed by the addition of 60% ammonium sulfate is resuspended in 1 00 mM potassium phos- 
phate buffer. ptH 7.0 to yield approximately 80 to 100 n>g/ml total protein, and is centrifuged at 17,500 RPM as 
atx)ve. The supematant is temied the crude extract. 

Procedure Vlil. Avidin IMonoiner Affinity Chromatography. 

25 

The crude extract is applied to a 4mm X 5 cm column packed with avidin monomer affinity resin (US serial 
no. 414.785) on a LXB HPLC system equipped with two Model 21 50 pumps. Model 21 52 controller, and a Model 
2140 spectral detector. Sample absorbance is nrK>nitored at 280 nm. The crude extract is applied in 100 mM 
potassium phosphate buffer, pH 6.8 to 7.2 (or other appropriate buffer at a pH of 4.0 to 1 1.0) at a flow rate of 

30 0.1 ml /min to a column equilibrated with the same buffer. 

After the sample is loaded, the column is washed with phosphate buffer at a flow rate of 1 ml /min until 
absorbance returned to the baseline absorbance, and all nonbound material is washed from the resin. The col- 
umn Is then equilibrated with water, followed by application of 5 ml of 2M NaCI. The column is reequilibrated 
with water. This same NaCI-water wash procedure is repeated four to five times. The sample is eluted using 

35 acetic acid or biotin. as detailed in Procedure IX. 

Procedure iX. Eiution of the Hybrid Polypeptide from the Avidin IMonomer Affinity Resin 

A. Elutk)n using acetic acid 

40 

Five ml of 10% glacial acetic acid are applied to the column. Eluted hybrid polypeptide is collected until the 
absorbance at 280 nm returns to the baseline absorbance. 

B. Elutk)n using biotin 

45 

Five ml of 10 mM biotin in 100 mM potassium phosphate buffer, pH 6.5 is applied to the column. Eluted hybrid 
polypeptide is collected until the absort>ance at 280 nm returns to the t>aseline at>sorbance. 

Procedure X. Cieavage of the Poiypeptlde of Interest from the Poiypeptide for Attachment 

50 

A. Add cleavage 

[Reference: London, M. (1977) Methods in Enzymology 47:145-149.] 

The hybrid polypeptide suspension is adjusted to 70% fonrtlc acid (vA^) and incut>ated at 40^0 for 24 to 48 
55 hours. The mixture is then freeze-dried. Highly pure polypeptide of interest is obtained by Procedure VIII. 
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B. Cyanogen bromide cleavage of methionine residues [Reference: Gross, E. and B. Witkop (1961) joumal 
of American Chemical Society 83:1510-1511 J 

The hybrid polypeptide is dissolved in 70% (v/v) aqueous formic acid at 23''C. A 50 rTK>lar excess of cya- 
5 nogen bromide is added in a small volume of 70% formic acid, with stirring. 

The mixture is incubated in the dark under nitrogen at 20-25''C for 1 6 to 24 hours. The mixture is then diluted 
with 10 volumes of water and fireeze-dried. 

Highly pure polypeptide of interest is obtained by Procedure VII 1. 

10 Procedure Xi. Separation of the Polypeptide of Interest from the Polypeptide for Attachment 

The dried polypeptide mixture is resuspended in avidin monomer column loading buffer, 100 mM phosphate 
buffer pH 6.8-7.2, or other appropriate buffer at a pH of between 4.0 and 1 1 .0. Highly pure polypeptide of interest 
is obtained by passing the cleaved polypeptide mixture over the avidin resin using the procedure descrit>ed 
IS above. 

The polypeptide for attachment is retained by the avidin monomer and the polypeptide of interest is not 
retained. The polypeptide of interest is collected In the column flowthrough. 

Procedure XII. Plasmid Expression Vectors. 

20 

Two plasmids are obtained from D. Saniols, Case Westem Reserve University. 

A. Plasmid ptac 1.3t This plasmid contains the DNA sequence coding for the 123 amino acid sequence 
of the 1.3S polypeptide of transcarboxylase firom Propionibacterium shemnanii (SEQ ID NO:2). The DHA 
coding for the 1.3S polypeptide is cloned as a 431 base pair fragment into the polylinker region of the ex- 

25 pression vector pKK223-3 as described by Murtif et al. (Proc Nat Acad Sci USA 82:5617-5621 (1985)). 

B. Plasmid ptaci .3(1-125). The plasmki ptac 1.3(1-125) is described by Murtif and SanrK)ls, J Bk>l Chem 
262:11813-11815 (1987). Like ptaci .3t, ptaci .3(1-125) also contains the 1.3S polypeptide but in addition 
has the sequence: 

^ GAT CCIA TAA CGC CTA AGC TT (SEQ ID NOS) 

at the 3' end of the 1 .3S gene that encodes a Bam HI restrictton endonudease site. This DNA additional se- 
quence codes for the lintdng amino acid sequence asp-pro at the carboxyl terminus of the 1 .3S polypeptide. 
In order to illustrate the nature of this invention and the nr^anner of practicing the same, the following ex- 
35 amples are presented. 

Example 1. Modification of ptaci. 3(1 -125) to increase hytxld polypeptide expression levels in E. coli. 

In order to increase the expression level of hybrid polypeptides produced from chimeric genes inserted into 
40 ptaci .3(1 - 1 25) from approximately 0. 1 % of total soluble cellular protein to approximately 5.0% of total soluble 
cell protein, ptad .3(1-125) was modified as follows. ptac1.3(1-125) was digested with the restriction enzymes 
Xhol and Hindlll. The desired 131 base pair (bp) fragment was obtained by agarose gel electroplYoresis. 

The vector ptac1.3t was also digested with Xhol and Hindlll using the conditions described above, and the 
4.86 kilot>ase (kb) fragment was obtained by agarose gel purification. The plasmid ptaci .3dp (Figure 1) was 
45 obtained by ligation of the 131 bp fragment from ptaci. 3(1-1 25) to the 4.86 kb fragment of ptaci. 3t. 

The ligated plasmid mixture was used to transform competent E. coli HB101. An E. coll done harboring 
ptaci .3dp was IdentiTied by restriction enzyme digestion of plasmids isolated from selected ampicillin-resistant 
E. coli cells. 

50 Example 2. Fusion of a polypeptide of interest to the C-terminus of a polypeptide for attachment with an 
acid deavage site between tt>e polypeptides 

This example descrit>es a hybrid FX)lypeptide in which a synthetic t>-endorphin pdypeptide is fused to the 
cartx)xyl terminus of the 1 .3S polypeptide. An asp-pro deavage site is incorporated between the two polypep- 
55 tides for deavage and subsequent purification of b-endorphin away from the 1 .3S polypeptide after affinity pur- 
ification using avidin monomer resin. 

The amino add sequence of a modified b-endorphin polypeptide is shown in SEQ ID N0:6 and the corre- 
sponding nucleotide sequence coding for this amino acid sequence is shown in SEQ ID NO:7. 
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The synthetic ollgonudeotides from which this synthetic gene was assembled are shown in SEQ ID N0:8 
(RHcbel) and SEQ ID NO:9 (RHcbe2). There are no internal methionine residues in the modified b-endorphin 
polypeptide. 

The BamHI endonudease deavage recognition sequence GGATCC at nudeotide positions 12 through 17 
5 at the 5' end of SEQ ID NO:6 allows the introdudon of an ATG codon at the 5' terminus of the l>-endorphin 
gene when this fragment is introduced into the BamHI site of ptac1.3dp (SEQ ID NO:5), thereby adding a me- 
thionine at the ^4-terminus of the polypeptide. 

To maximize expression in E. coli, the amino acid sequence of authentic b-endorphin was reverse trans- 
lated into a Dl^ sequence using preferred codon usages of highly expressed E. coli genes (DeBoer, H Chapter 
10 8, Maximizing Gene Expression, W. ReznikofF and L. Gold, Eds.). 

RHcbel and RHcbe2 were synthesized, annealed and filled by T7 DNA polymerase as described in Pro- 
cedure VII. 

The resulting double stranded DNA sequence (SEQ ID IMO:5) was digested with BamHI . generating 105 
bp fragment coding for the synthetic b-endorphin which was purified by agarose gel electrophoresis. 
15 The plasmid vector pUC19 (Sambrook et al, Molecular Cloning, A Laboratory Manual, 2nd Editk>n, Vol 1, 
1 989. p. 1 . 1 3) was linearized with BamHI. The 5' terminus of the linearized plasmid was dephosphorylated prior 
to ligation by incubating the digest mbcture with calf intestinal phosphatase using the protocol described by Sanv 
brook et al (ibid, Vol.1 , pp. 3.38-3.39), to minimize self-ligation of the vector. 

Pure linear plasmid was recovered by agarose gel electrophoresis, and the 1 10 bp synthetic b-endorphin 
20 gene fragment was ligated to the pUC19 plasmid, and this tigated plasmid was used to transform competent 
E.coliHB101. 

Following plasmid isolation from ampicillin-resistant clones, the recombinant E. coli cells harboring the cor- 
rect plasmid were identified by restriction enzyme digestion. This recombinant plasmid containing the gene for 
the synthetic b-endorphin was designated pUCI 9endorB3. 

25 

Cloning of synthetic b-endorphin into ptacl.Sdp. 

The b endorphin gene in pUC19endorB3 was fused to the 3' terminus of the 1.3S gene in ptac1.3dp as 
follows: a 1 05 bp b-endorphin gene fragment was generated by digestion of pUCI 9endorB3 with BamHI , and 

30 purified by agarose gel electrophoresis. 

The vector ptac1.3dp, whteh contains two BamHI sites, was partially digested with BamHI . Plasmid DNA 
cut at only one BamHI site was purified by agarose gel electrophoresis. 

The 105 bp b-endorphin gene was ligated into the BamHI site of the linearized ptac1.3dp plasmid and the 
ligation mixture was used to transform competent E. coli HB101. 

35 The ligated plasmid containing the endorphin gene in the proper orientation was identified by restriction 
enzyme analysis of plasmids isolated from ampicillin-resistant transfbmned E. coli and was designated 
ptac1.3dp:endorB3 (Figure 2A). This plasmid codes for a hybrid fusion polypetide consisting of the 1.3S poly- 
peptide fused at its carboxyl tenminus to an asp-pro cleavage sequence fused to a syntiietic b-endorphin poly- 
peptide containing a methionine residue at position 1 . 

40 A highly pure preparation of the synthetic b-endorphin was obtained by inoculation of L- broth containing 
100 mg/l ampicillin with tiie E. coli host harboring ptac1.3dp:endorB3. A crude protein extract containing the 
1.3S:b-endorphin hybrid polypeptide was obtained by following Procedure VII. Highly pure 1.3S:b-endorphin 
polypeptide was obtained by avidin monomer affinity chromatography described in Procedure VIII, using acetic 
acid to elute the purified hybrid polypeptide from the resin (Procedure IX A). 

45 Cleavage of b-endorp hin from the 1 .3S polypeptide was accom pi ished by incu bation in formic acid accord- 
ing to Part A of Procedure X, and highly pure b-endorphin was obtained by avidin monomer affinity chroma- 
tography of the cleavage mixture by repeating Procedure XI. Following add cleavage of an asp-pro linking se- 
quence, a proline residue remains at the N-terminus of the cleaved t)-endorphin polypeptide. 

Clones containing the b-endorphin gene firagnr>ent inserted in the opposite orientation from 

50 ptaci .3dp:endor63 yielded a 1 .3S polypeptide fused to a novel 21 amino acid reverse endorphin peptide joined 
by the linking amino add sequence asp-pro, The gene designated ptad .3dp:revendorB3 that encodes this nov- 
el peptide is shown in Figure 2 B. This polypeptide could be purified by avidin monomer chromatography (Pro- 
cedure VIII), eluted in high yield and high purity from the column using acetic add (Procedure IX A), 

This example further demonstrates production of a hybrid polypeptide containing a polypeptide for binding 

55 to avidin as an efficadous method for obtaining polypeptides of interest in high yiekJ and high purity. 
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Example 3. Fusfon of a polypeptide of interest to a polypeptide for attachment, with a methionine cleavage 
site between the two polypeptides. 

In this example, the synthetic t>-endorphin polypeptide is fused at its N-terminus to a methionine residue, 
5 this methionine residue being positioned at the C-temninus of the 1 .3S polypeptide, thus providing a single amn 
no acid cleavage site for separation of b-endorphin from the 1 .33 polypeptide following avidin nwnomer chro- 
matography. 

Cleavage with cyanogen bromide yields an unmodified N-terminus on t>-endorphin, as the pDethionine is 
cleaved from the N-terminus of b-endorphin. 

10 The vector ptad .3dp is digested to completion with Xhol and Hindlll, and the 1 31 bp fragment is purified 
by agarose gel electrophoresis. This 1 31 bp fragment Is subjected to partial digestion with Sau3A, and the 11 0 
bp fragment so generated is isolated and purified by agarose gel electrophoresis. This Xhol-Sau3A fragment 
Is ligated to a double-stranded synthetic DNA fragment (SEQ ID N0:12) coding for b-endorphin. This b-endor- 
phin has a nwthionine residue at its N-temiinus. This fragment in SEQ ID N0;12 is assembled from synthetic 

15 oligonucleotides SEQ ID NO: 10 and SEQ ID NO: 1 1 as described in Procedure VI. and is digested to completion 
with Sau3A prior to ligation to the 1 1 0 bp Xhol-Sau3A fragment. TTie 220 bp product of this ligation Is purified 
by agarose gel electrophoresis. 

The vector ptad .3dp is subjected to partial digestion with Xhol and BamHI . and the 4886 bp linear vector 
is purified by agarose gel electrophoresis. The ligated plasmid is used to transform competent E. coll CSH26, 

20 prepared according to Procedure IV. Plasmids are isolated from transfonmed ampicillin-resistant E. coli clones, 
and a plasmid containing the desired gene in the correct orientation is identified by restriction enzyme analysis 
and designated ptad .3dp:metendor (Figure 2C). 

A highly pure preparation of the synthetic t>-endorphin is obtained by inoculation of L-broth containing 100 
rr>g/! ampicillin with the E. coli host harboring ptad .3:metendor. A crude protein extract containing the 1.3S:b- 

25 endorphin hybrid polypeptide is obtained by following Procedure VII. 

Highly pure 1.3S: b-endorphin polypeptide is obtained by avidin monomer affinity chromatography descfh 
bed in Procedure VIII. using elution with acetic acid to elute the purified hybrid polypeptide from the resin (pro- 
cedure VIIIA). aeavage of b-endorphin from the 1 .38 polypeptide was accomplished by incubation in cyanogen 
bromide according to Part B of Procedure X, and highly pure b-endorphin is obtained by avidin monomer affinity 

30 chromatography of the cleavage mixture by repeating Procedure XI. 

Example 4. Fusion of a polypeptide of interest directiy to the carboxyl terminus of a polypeptide for attach- 
rrtent. with no linking amino acid sequence being present in the hybrid polypeptide. 

55 In this example, a syntiietic b-endorphin polypeptide is fused directly to the C-tenminus of the 1.3S poly- 

peptide. Avidin monomer chromatography is used to obtain highly pure b-endorphin in the form of a hyt>rid fu- 
sion polypeptide. 

The construct ptac1.3t (Procedure XII) is digested to completion with Xhol and Hindlll, and ttie smaller 131 
bp fragment is purified by agarose gel electrophoresis. This 131 bp fragn^ent is subjected to partial digestion 
40 with Sau3A I, and the 110 bp fragment is agarose-gel purified. The 110 bp fragment is ligated to the double 
stranded DNA fragment SEQ ID NO: 15. 

SEQ ID NO:1 5 encodes a synthetic b-endorphin gene with no DNA coding for a linking amino acid or amino 
acid sequence at its 3' terminus. 

SEQ ID NO:15 is assembled from synthetic oligonucleotides SEQ ID NO:13 and SEQ ID NO:14 using Pro- 
45 cedure VI. 

Prior to ligation to the 110 bp fragment, SEQ ID NO:15 is digested to completion with Sau3A I. The 217 
bp ligation product is purified by agarose gel electrophoresis. 

Vector ptad .3dp is linearized by partial digestion witti Xhol and BamHI . and the 4886 bp fragment is also 
purified using agarose gel electorphoresis. 
50 The ligated plasmid is used to transform competent E. colt CSH26. 

Recombinant plasmids are isolated from ampicillin-resistant transfonmants, and a done containing tiie de- 
sired gene in the conrect orientation is identified by restriction enzyme analysis and designated ptac1.3:endor 
(Figure 2D). 

A highly pure preparation of the synthetic b-endorphin is obtained by inoculation of L-broth containing 1 00 
55 mg/1 ampicillin with ttie E. coli host harboring ptac1.3:endor. 

A crude protein extract containing the the 1 .3S:b-endorphin hybrid polypeptide is obtained by following Pro- 
cedure VII. 

Highly pure 1.3S:b-endorphin polypeptide is obtained by avidin nrwnomer affinity chromatography as de- 
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scribed in Procedure Vlll. using elution with acetic acid to elute ttie purified hybrid polypeptide flrom the resin 
(procedure VIIIA). 

Example 5. Fusion of two polypeptides of interest to the C-tenmlnus of a single polypeptide for attachment in 
5 the order, polypeptide for attachment: polypeptide of interest: polypeptide of interest. 

In this example, tvw> b-endorphin polypeptides in tandem are fused to the C-terminus of one 1 .3S polypep- 
tide, with the linking amino acids asp-pro-nr»t separating the fffst b-endorphin polypeptide from the C-terminus 
of the 1 .3S polypeptide and another sequence of asp-pro-met separating the first b-endorphin from the second 
10 b-endorphin polypeptide. Such a fusion doubles the yield of the polypeptide of interest, at the same tinr^ pro- 
viding a means for purification of that polypeptide by avidin affinity chromatography. 

The plasmid ptac1.3dp (Figure 1) is digested to completion with Xhol and BamHI. The resulting 118 bp 
fragment that encodes for the sequence beginning at amino acids 8 to the asp-pro site generated by the BamHI 
site at the 3' terminus of the 1.3S polypeptide as found in SEQ ID NO:5. This fragment is purified by agarose 
15 get electrophoresis. 

Two tandem b-endorphin polypeptides are generated by the creation of the double-stranded DNA frag- 
ments SEQ ID N0:18. assembled from oligonudetides SEQ ID NO:16 and SEQ ID N0:17; and SEQ ID NO:21, 
assembled from SEQ ID N0:19 and SEQ ID NO:20. 

SEQ ID NO:18 and SEQ ID NO:21 were assembled from their respective oligonuleotides using the syn- 
20 thesis and strand assembly strategy described in Procedure VI. 

SEQ ID N0:18 and SEQ ID NO:21 are digested with BamHI and ligated. 

This fragment codes for two b-endorphin polypeptides in tandem, separated from each other by an asp- 
pro cleavage sequence. The dimeric product of ligation is purified by agarose gel electrophoresis, and this frag- 
ment is ligated to the 118 bp Xhol-BamHI 1.3S partial coding sequence obtained from ptac1.3dp. This ligation 
25 product is puriTied by agarose gel electrophoresis, and is ligated to a 4886 bp fragment generated by partial 
digestion of ptaci .3dp linearized by partial digestion with Xhol and BamHI. 

Plasmid DNA is isolated from transformed E. coli HB101. Plasmids containing the correct chimeric gene 
orientation are confirmed by restriction endonudease mapping and designated ptaci .3: endorendor (Figure 
3A). 

30 A highly pure preparation of synthetic b-endorphin is obtained by inoculation of L- broth containing 1 00 mg/l 

ampicillin with the E. coll host hart>oring ptaci. 3:endorendor. 

A crude protein extract containing the 1.3S:b-endorphin: b-endorphin hybrid polypeptide is obtained by fol- 
lowing Procedure VII. Highly pure 1.3S:b-endorphin: b-endorphin polypeptide is obtained by avidin nr>onomer 
affinity chromatography described in Procedure Vltl, using acetic acid to elute the purified hybrid polypeptide 
35 from the resin (procedure VIIIA). 

Cleavage of both b-endorphin polypeptides from the 1.3S polypeptide in a single step is accomplished by 
incubation in fonmic add according to Part A of Procedure X, and highly pure b-endorphin is obtained by avidin 
n[K)nomer affinity chromatography of the deavage mbcture by repeating Procedure XI. 

40 Example 6. Fusion of one polypeptide of interest to the N-tenminus of a polypeptide for attachnr^ent and fu- 
sion of a second polypeptide of interest to the C-terminus of the san>e pdypeptide for attachment 

In this example, the maltose binding protein (Guan, C. et al.. Gene 67-21-30 (1987) and Maina, et al.. Gene 
74:365-373 (1988)) was fused to the N-tenminus of the 1.3S polypeptide and synthetic b-endorphin was fused 

45 to the C-terminus of the sanr» 1 .3S pdypeptide, thus creating a hybrid polypeptide consisting of two different 
noncontiguous pdypeptides of interest 

The construct ptaci .3dp:endor63 (Figure 2A) was d igested with Sail and Hindlll. A 438 bp fragment created 
was purified by agarose gel elecbt>phoresis. This fragment encodes amino acids 19 to 123 of the 1.3S poly- 
peptide, the asp-pro-met linlcing amino acids, and the 31 amino acid b-endorphin polypeptide. 

50 The vector pMAL-c (obtained from New England Bidabs) was linearized by digestion with Sail and Hindlll. 

This vector contains the maltose binding protein under the regulation of the tac prorTK)ter (Guan, C. etal., Gene 
67-21-30 (1987) and Maina, et al.. Gene 74:365-373 (1988)). The linearized vector and the 438 bp 1.3S-b-en- 
dorphln fragment were ligated, and the ligation mix was used to transform competent E. coli CHS26. Plasmid 
DNA was isdated, and plasmids containing the correct chimeric gene orientation were confirmed by restriction 

55 endonudease mapping. The resulting clone was designated ptac:malB:1 .3:endor63 (Figure 3B). 

A highly pure preparation of the hybrid maltose binding protein-synthetic b-endorphin polypeptide is ol^ 
tained by inoculation of L-broth containing 100 mg/l ampicillin with the E. cdi host harboring ptac:malB: 1 .3:en- 
dorB3. 
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A crude protein extract containing the hybrid polypeptide is obtained by following Procedure VII. 

Highly pure maltose binding protein:1.3S:b- endorphin hybrid polypeptide is obtained by avidin mononner 
affinity chromatography described In Procedure VIII, using biotin to elute the purified hybrid polypeptide from 
the resin (Procedure VIIIB). 

5 Biotin Is removed from the polypeptide suspension by dialysis against three changes of 1 00 mM ammonium 

carbonate buffer, pH 7.2, followed by freeze-drying of the sample. 

Cleavage of b-endorphin from the 1.3S polypeptide is accomplished by incubation in cyanogen bromide 
according to Part B of Procedure X, and highly pure b endorphin is obtained by avidin monomer affinity chro- 
matography of the cleavage mixture by repeating Procedure XI. 

10 The maltose-binding protein-1.3S hybrid polypeptide was recovered in highly pure form by repeating the 
biotin elution procedure described in Procedure VIIIB. 

Example 7. Fusion of two polypeptides of interest in tandem to the N-tenninus of a polypeptide for attach- 
ment, with an amino acid cleavage sequence separating the first polypeptide of interest from the polypep- 
15 tide for attachment, and a second amino acid cleavage sequence separating the first polypeptide of interest 
from the second polypeptide of interest . 

The nr>altose binding protein and b-endorphin are fused in tandem to the amino-terminus of the 1.3S poly- 
peptide. An amino acid cleavage site separates the maltose binding protein and b-endorphin, and another ami- 

20 no acid cleavage site separates b-endorphin and the 1 .38 polypeptide. 

The plasmid ptac1.3dp is digested with Hindi and Hindlll. The 332 bp fragment is purified by agarose gel 
electrophoresis. Linkers encoding a BamHI recognition sequence (CGGATCCG) are ligated to this fragment, 
and the fragment is digested with BamHI to generate a BamHI site at the 5' tenminus of the fragment The 338 
bp BamHI -Hindlll fragment so generated is purified by agarose gel electrophoresis . 

25 The DNA fragment SEQ ID NO:15 is digested with BamHI . and is ligated to the 338 bp BamHI-Hindlll nwdi- 
fied fragment from ptac1.3dp. The desired 444 bp fragment is purified by agarose gel electrophoresis. The vec- 
tor pMAL-c (New England Biolabs, Beverly, MA) is digested to a 6.1 kb fragment with Hindlll and BamHI , and 
the large fragment is purified by agarose gel electrophoresis. The 444 bp fragment and the 6.1 kb fragment 
are ligated. and the ligation mix is used to transform competent E. coli CSH26. 

30 Plasmid DNA Is isolated, and plasmids containings the chimeric gene in the correct orientation are con- 
firmed by restriction endonudease mapping, and are designated ptac:malC:endorB3: 1.3dp (Fig. 3C). The cor- 
rect recombinant plasmid codes for a fuston protein composed of a 42,000 MW nnaltose binding protein fused 
by an asp-pro-met linker to b-endorphin joined by an asp-pro linker to amino acids 1^123 of the 1.3S poly- 
peptide having an asp-pro carboxyl terminus. 

35 Highly pure maltose binding protein:b- endorphin: 1 .38 hybrid polypeptkJe is obtained by avidin monomer 

affinity chromatography described in Procedure VIII, using biotin to elute the purified hybrid polypeptide from 
the resin (Procedure VIIIB). Biotin is removed from the polypeptide suspension by dialysis against three 
changes of 100 mM amnwnium carbonate buffer, pH 7.2, followed by fireeze-drying of the sample. 

Cleavage of b-endorphin from the 1 .38 polypeptide is accomplished by incubation in fr)rmic acid according 

40 to Part A of Procedure X, and highly pure b-endorphin is obtained by avidin monomer affinity chromatography 
of the deavage mbcture by repeating Procedure XI. The maltose binding protein is recovered in highly pure form 
by elution of the maltose binding protein:1 .38 hybrid polypeptkle from the avklin nnonomer resin with btotin using 
Part B of Procedure IX. 

The hybrid polypeptkJe is purified away from the biotin by dialysis against three changes of 100 mM am- 
45 nK>nium carbonate buffer, pH 7.2, fdlowed by freeze-drying of the sample. The sample is freeze-dried, and 
reconstituted in cyanogen bromide according to Part B of Procedure X. The maltose-binding protein is recov- 
ered in highly pure forni by repeating the avidin monomer chromatography process detailed in Procedure XI. 

Example 8. Fuston of two polypeptkies of interest to the C-tenmini of two polypeptkles for attachment within 
50 the same hybrid polypeptide. 

In this example, two noncontiguous b-endorphin polypeptkles are fused to two noncontiguous 1.3S poly- 
peptides within one hybrid polypeptkJe with a deavage amino add sequence between each 1.38 polypeptide 
and the b-endorphin to which it is directly linked, producing the fusion hybrid polypeptide 1.3S:asp-pro-met b- 
55 endorphin:asp-pro:1 .3S:asp-pro-met b-endorphin. 

The vector 1 ,3dp:endorB3 (Figure 2A) is digested with Hindi and Hindlll and the 437 bp fragment is purified 
by agarose gel electrophoresis. Synthetk: DhiA linkers encoding a BamHI recognition sequence, CGGATCCG 
(New England Biolabs, Inc. Beveriy, MA) are ligated to this 437 bp fragment 
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Following ligation, this DHA is subjected to BamHl digestion to generate BamHI cohesive termini, and the 
443 bp BamHI-Hlndlll fragnnent Is purified by agarose gel electrophoresis. The ptac 1.3:endorendor (Figure 
3A) is partially digested with BamHI and Hindlll to yield a linear DNA of 5.1 kb digested at a single BamHl site 
and also at a single Hindlll site. This 5.1 kb fragment is purified by agarose gel electrophoresis, then is ligated 
5 to the 443 bp fragment. The ligatbn mbc is used to transform competent E. coli HB1 01 . 

Plasmid DNA is isolated from ampicill in-resistant transformed E. coli, and ptasmids containing the correct 
chimeric gene orientatk>n are confirmed by restnction endonudease mapping. 

The recombinant plasmid obtained by this procedure is designated ptad .3:endor 1 .3:endor (Figure 4) and 
encodes a hybrid polypeptide which permits the isolation of two molecules of b-endorphin for every hybrid poly- 
10 peptide purified, which may double the yield of polypeptide from a single fermentatk>n. 

A highly pure preparation of synthetk: b-endorphin is obtained by inoculation of L- broth containing 1 00 mg/l 
ampiclllin with the E. cdl host hart>oring ptaci .3: endor.1 .3:endor. A crude protein extract containing the 1 .3S:t>- 
endorphin:1.3S:b-endo^phin hybrid polypeptide Is obtained by following Procedure VII. 

Highly pure 1.3S: b-endorphin: 1.3S :b-endorphin pdypeptkJe is obtained by avidin nK>nomer affinity chro- 
15 matography described in Procedure VIII, using acetic acid to elute the purified hybrid polype ptkJe from the resin 
(procedure VI II A). 

Cleavage of t>oth b-endorphin polypeptides from both 1.3S polypeptides in a single step is accomplished 
by incubatk)n In formic acid according to Part A of Procedure X, and highly pure t>-endorphin is obtained by 
avidin monomer affinity chromatography of the cleavage mbcture by repeating Procedure XL 

20 
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SEP ID NO: 2 

SEQUENCE TYPE: Peptide 

SEQUENCE LENGTH: 43 

MOLECULE TYPE: Pepdde 

ORIGINAL SOURCE ORGANISM: Bacterium 

SOURCE NAME: Propionibacterium shermanii 

PROPERTIES: Biotin-binding recognition sequence 

Pro Ala Pro Leu Ala Gly Thr Val Ser Lys lie Leu Val Lys Glu Gly 

15 10 15 

Asp Thr Val Lys Ala Gly Gin Thr Val Leu Val Leu Glu Ala Met Lys 

20 25 30 

Met Glx Thr Glu lie Asn Ala Pro Thr Asp Gly 

35 40 
SEO ID NO: 3 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 20 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 

PROPERTIES: Termination fragment for a BamHI deavage site 
GATCCATAAC GCCTAAGCTT 
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SEP ID NO:4 



SEQUENCE TYPE: Nucleotide 



SEQUENCE LENGTH: 372 



STRANDEDNESS: Double-stranded 



TOPOLOGY: Linear 



MOLECULE TYPE: Genomic DNA 



ORIGINAL SOURCE ORGANISM: Bacterium 



SOURCE NAME: Propionibacterium shermanii 



PROPERTIES: Gene coding for 1.3S polypeptide 



ATGAAACTG A AGGTAACAGT CAACGGCACT GCGTATGACG 40 

TTGACGTTGA CGTCGACAAG TCACACGAAA ACCCGATCGG 80 

CACCATCCTG TTCGGCGGCG GCACCGGCGG CGCGCCGGCA 120 

CCGCGCGCAG CAGGTGGCGC AGGCGCCGGT AAGGCCGGAG 160 

AGGGCGAG AT TCCCGCTCCG CTGGCCGGCA CCGTXnCCAA 200 

GATCCTCGTG AAGG AGGGTG ACACGGTCAA GGCIGGTCAG 240 

ACCGTGCTCG TTCTCGAGGC CATGAAGATG GAGACCGAG A 280 

TCAACGCTCC CACCG ACGGC AAGGTCG AGA AGGTCCFIGT 320 

CAAGGAGCGT GACGCCGTGC AGGGCGGTCA GGGTCTCATC 360 

AAG ATCGGCT G A 372 



55 
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SEP ID NCk5 

5 SEQUENCE TYPE: Nudeotide 

SEQUENCE LENGTH: 390 

STRANDEDNESS: Double-stranded 
10 TOPOLOGY: Linear 

MOLECULE TYPE: Genomic DNA 

PROPERTIES: Oligonucleotide ptacl.3dp 

IS 

ATGAAACTGA AGGTAACAGT CAACGGCACT GCGTATGACG 40 
TTGACGTTGA CGTCGACAAG TCACACGAAA ACCCGATGGG 80 
^ CACCATCCTG TTCGGCGGCG GCACCGGCGG CGCGCCGGCA 120 
CCGCGCGCAG CAGGTGGCGC AGGCGCCGGT AAGGCCGGAG 160 
AGGGCG AG AT TCCCGCTCCG CTGGCCGGC A CCGTCTCCAA 200 
GATCCTCGTG AAGGAGGGTG ACACGGTCAA GGCTGGTCAG 240 
ACCGTGCTCG TTCTCGAGGC CATGAAGATG GAGACCGAGA 280 
TCAACGCTCC CACCGACGGC AAGGTCGAGA AGGTCCTTGT 320 

30 

CAAGGAGCGT GACGCCGTGC AGGGCGGTCA GGGTCTCATC 360 
AAGATCGGCT GATCCATAAC GCCTAAGCTT 390 

SEOIDNQe 

SEQUENCE TYPE: Peptide 
SEQUENCE LENGTH: 31 

40 

MOLECULE TYPE: Peptide 

PROPERTIES: Modified b-endorphin polypeptide 

45 

Tyr Gly Gly Phe Leu Thr Ser Qu Lys Ser Gin Thr Pro Leu Val Thr 
15 10 15 

^ Leu Phe Lys Asn Ala He lie Lys Asn Ala Tyr Lys Lys Gly Glu 

20 25 30 
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SEP ID NO:7 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 135 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 

PROPERTIES: Gene coding for modified b-endorphin polypeptide 

AAGCTTCTAG AGGATCCTAT GTACGGTGGT TTCCTGACCT 
CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTCAAAAA 
CGCTATCATC AAAAACGCAT ACAAAAAAGG CGAATAAGGA 
TCCGAATTCG AGCTC 



SEOIDNO:8 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: OUgonudeotide RHcbel 

AAGCTTCTAG AGGATCCTAT GTACGGTGGT TICCIGACCT 
CCGAAAAATC TCAGACCCCG CrGGTTACTC TGTTC 
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SEP ID NQ 9 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Oligonucleotide RHd)^ 



GAGCTCGAAT TCGGATCCTT ATTCGCCTTT TITGTATGCG 
TTTTTGATGA TAGCGTTTTT GAACAGAGTA ACCAG 



SEP ID NO. 10 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Synthetic oligonucleotide 

AAGCTTCTAG AGATCGGCAT GTACGGTGGT TTCCTGACCT 
CCGAAAAATC TCAG ACCCCG CTGGTTACTC TGTTC 
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SEP ID Nail 

5 SEQUENCE TYPE: Nucleotide 

SEQUENCE LENGTH: 75 

STRANDEDNESS: Double-stranded 
10 TOPOLOGY: Linear 

MOLECULE TYPE: Genomic DNA 

PROPERTIES: Synthetic oligonucleotide 

15 

GAGCTCGAAT TCGGATCCTT ATTCGCCTTT TTTGTATGCG 40 
20 TmTGATGA TAGCGITITI GAACAGAGTA ACCAG 75 



26 SEP ID NO:12 

SEQUENCE TYPE: Nucleotide 
30 SEQUENCE LENGTH: 135 

STRANDEDNESS: Double-stranded 

TOPOLOGY: Linear 
^ MOLECULE TYPE: Genomic DNA 



AAGCrrCTAG AGATCGGCAT GTACGGTGGT TTCCTGACCT 40 

CCGAAAAATC TCAGACCCCG CrGGTTACTC TGTTCAAAAA 80 

CGCTATCATC AAAAACGCAT ACAAAAAAGG CGAATAAGGA 120 

*' TCCGAATTCG AGCTC 135 



50 
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SEP DM NO:13 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERITES: Synthetic oUgonudeotide 

AAGCTTCTAG AGATCGGCTA CGGTGGTITC CTGACCTCCG 
AAAAATCTCA GACCCCGCTG GTTACTCTGT ICAAA 



SEOIDNO:14 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 72 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Synthetic oUgonudeotide 

GAGCTCGAAT TCGGATCCTF ATTCGCCTIT nTGTATCCG 
TTITTGATGA TAGCGTmT GAACAGAGTA AC 
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SEP ID NO:15 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 132 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 



AAGCTTCTAG AGATCGGCTA CGGTGGTTTC CTGACCTCCG 
AAAAATCTCA GACCCCGCTG GTTACTCTGT TCAAAAACGC 
TATCATCAAA AACGCATACA AAAAAGGCGA ATAAGGATCC 
GAATTCGAGC TC 



SEP ID Nai6 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Double-stranded 
TPFPLPGY: Linear 
MPLECULE TYPE: Genomic DNA 
PRPPERTIES: Synthetic oUgonudeotide 

AAGCTTCTAG AGGATCCTAT GTACGGTGGT TTCCIGACCT 
CCGAAAAATC TCAGACCCCG CTGGTTACTC TCTTC 
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SEP ID Nai7 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 71 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Synthetic oligonucleotide 

GAGCTCGAAT TCGGATCCTC GCCmTITG TATGCGTTTT 40 
TGATGATAGC GTTnTGAAC AGAGTAACCA G 71 



SEO ID NO: 18 

25 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 130 

30 

STRANDEDNESS: Double-stranded 

TOPOLOGY: Linear 

MOLECULE TYPE: Genomic DNA 

35 



AAGCTTCTAG AGGATCCTAT GTACGGTGGT TTCCTGACCT 40 

CCGAAAAATC TCAGACCCCG CTGGTrACTC TGTTCAAAAA 80 

CGCTATCATC AAAAACGCAT ACAAAAAAGG CGAGGATCCG 120 

AATTCGAGCTC 130 
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SEP ID NO:19 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Gnomic DNA 
PROPERTIES: Synthetic oligonucleotide 

AAGCTTCTAG AGGATCCTAT GTACGGTGGT TTCCTGACCT 
CCGAAAAATC TCAGACCCCG CTGGTTACTC TCTTC 



SEP ID Na20 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 75 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genomic DNA 
PROPERTIES: Synthetic oligonucleotide 

G AGCTCGAAT TCGGATCCTT ATrCGCCTTT TITGTATGCG 
mTTGATGA TAGCGITITI GAACAGAGTA ACCAG 
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SEQUENCE LISTING 
SEP. ID NO:l 

SEQUENCE TYPE: Peptide 

SEQUENCE LENGTH- 123 

MOLECULE TYPE: Pirotein 

ORIGINAL SOURCE ORGANISM: Bacterium 

SOURCE NAME: Propionibacterium shermanii 

FEATURES: From 58 to 100 - biotin-binding recognition sequence 

PROPERTIES: 13S biotin-binding protein 



Met Lys Leu Lys Val Thr Val Asn Gly Thr Ala Tyr Asp Val Asp Val 
15 10 15 

Asp Val Asp Lys Ser His Glu Asn Pro Met Gly Thr He Leu Phe Gly 

20 25 30 

Gly Gly Thx Gly Gly Ala Pro Ala Pro Arg Ala Ala Gly Gly Ala Gly 

35 40 45 

Ala Gly Lys Ala Gly Glu Gly Glu lie Pro Ala Pro Leu Ala Gly Thr 

50 55 60 

Val Ser Lys He Leu Val Lys Glu Gly Asp Thr Val Lys Ala Gly Gin 
65 70 75 80 

Thr Val Leu Val Leu Glu Ala Met Lys Met Gbc Thr Glu lie Asn Ala 

85 90 95 

Pro Thr Asp Gly Lys Val Glu Lys Val Leu Val Lys Glu Arg Asp Ala 

100 105 110 

Val Gin Gly Gly Gin Gly Leu He Lys He Gly 
115 120 
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SEP ID NO:21 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 135 
STRANDEDNESS: Double-stranded 
TOPOLOGY: Linear 
MOLECULE TYPE: Genoniic DNA 



AAGCTTCTAG AGGATCCTAT GTACGGTGGT TTCCTG ACCT 40 

CCGAAAAATC TCAGACCCCG CTGGTTACTC TGTTCAAAAA 80 

CGCTATCATC AAAAACGCAT ACAAAAAAGG CGAATAAGGA 120 

TCCGAATTCG AGCTC 135 



Claims 



1 . A recombinant hybrid polypeptide comprising a polypeptide of interest fused to an avidin- binding polypep- 
tide containing a blotin attachment domain, characterised in that the polypeptide of interest is fused to the 
C tenminus of the iavidin-binding polypeptide. ~ 

2. A recombinant hybrid polypeptide according to claim 1 wherein biotin is attached to the avidin- binding poly- 
peptide. 

3. A recombinant hybrid polypeptide according to claim 1 or dalm 2 wherein the polypeptide includes a cleav- 
age site for cleaving the polypeptide of interest from the avidin-binding polypeptide. 

4. A recombinant hybrid polypeptide according to any one of claims 1 to 3 wherein the avidin binding poly- 
peptide is, or is part of, a 1.3S polypeptide. 

5. A recombinant hybrid polypeptide according to claim 4 wherein the 1 .3S polypeptide is from Propionibac- 
terium. 

6. A recombinant hybrid polypeptide according to any one of clainrts 1 to 5 wherein the biotin attachment do- 
main of the avidin-binding polypeptide comprises at least one of the sequence Pro Ala Pro Leu Ala Gly 
Thr Val Ser Lys He Leu Val Lys Glu Gly Asp Thr Val Lys Ala Gly Gin Thr Val Leu Val Leu Glu Ala Met Lys 
Met Glu Thr Glu lie Asn Ala Pro Thr Asp Gly. 

7. A recombinant hybrid polypeptide according to any one of claims 1 to 6 wherein the avidin-binding poly- 
peptide comprises a plurality of non-contiguous and/or contiguous avidin-binding polypeptides, which may 
be the same or differenL 



8. A recombinant hybrid polypeptide according to any one of claims 1 to 6 wherein the polyp>eptide of Interest 
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10 



comprises a plurality cf non-contiguous and/or contiguous polypeptides of interest, wtiich may be the same 
or different. 

9. A recombinant tiybrid polypeptide according to any one of claims 1 to 8 wherein the polypeptide of interest 
is an enzyme, is an antigen useful for vaccine production, or a diagnostic reagent. 

10. A recombinant hybrid polypeptide according to any one of claims 1 to 9 wherein the polypeptide of interest 
has antitumor activity or has an amino acid sequence for recognition of antigens. 

11. A nucleic acid sequence coding for a hybrid polypeptide as defined in any one of claims 1 to 10 wherein 
the nucleic acid sequence comprises a nucleic acid sequence coding for an avidin-binding polypeptide 
upstream of a nucleic acid sequence coding for a polypeptide of interest. 

12. A nucleic acid sequence according to daim 1 1 wherein the nucleic acid sequence is a DNA sequence. 

15 13. A nucleic acid sequence according to claim 12 wherein the DNA sequence contains in a 5' to 3' direction 
on the coding strand a gene comprising a 5' promoter region, the DNA sequence coding for the avidin- 
binding polypeptide and the DNA sequence coding for the polypeptide of interest 

14. A nucleic acid sequence according to daim 12 or daim 1 3 wherein the DNA sequence is, or is part of, an 
20 expression vector or a plasmid. 

15. A process for the production of a hybrid polypeptide as defined in any one of daims 1 to 10 comprising 
constructing a plasmid containing a nudeic acid sequence as defined in any one of daims 11 to 13, trans- 
forming the plasmid into a procaryotic or eucaryotic host cell expression system, expressing the system. 

25 contacting the hybrid polypeptide resulting from the expression system with avidin, and harvesting the re- 

sulting avidin-bound hybrid polypeptide. 

16. A process according to claim 15 wherein the expression system is either E. Coli or insect cells. 



30 



17. A process for the isolation of a hybrid polypeptide as defined in any one of daims 1 to 10 comprising con- 
tacting the hybrid polypeptide with avidin. 

18. A process according to any one of claims 15 to 17 wherein the avidin is nrK3nomeric avidin, tetrameric avidin 
or streptavidin. 

35 1 9. A process according to any one of daims 1 5 to 1 8 wherein the polypeptide of interest is deaved from the 
isolated hybrid polypeptide. 

20. A process according to any one of daims 15 to 1 9 wherein the hybrid polypeptide is isolated using avidin 
covalently bound to a chemically inert solid, water and soh/ent insoluble substrate through a chemically 

40 stable non-hydrolyzable linlcing group, preferably the hybrid polypeptide is isolated using avidin monomer 

affinity chromatography. 

21. A first kit comprising a hybrid polypeptide as defined in any one of daims 1 to 10 and avidin. 

22. A second kit comprising a nudeic acid sequence as defined in any one of dainns 1 1 to 14 and avidin. 

45 

23. A third kit comprising a nudeic add sequence which codes for an avkiin-binding polypeptide containing 
a biotin attachment domain and which is fusable to a nucleic acid sequence coding for a polypeptkle of 
interest in order to form a hybrid nudek; acid sequence as defined in any one of daims 1 1 to 14 and avkJin. 

^ 24. A kit according to daim 23 wherein the kit comprises means to fuse the nudeic acid sequence coding for 
the avkJ in-binding polypeptide to the nudek: ackj sequence coding for the polypeptide of interest in order 
to form the nudeic acid sequence as defined in any one of dalnris 1 1 to 14. 

25. A kit according to any one of claims 21 to 24 wherein the kit comprises means to deave the polypeptide 
55 of interest from the avidin-binding polypeptide. 
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Ptac1.3dp 



Fig. 1. 
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us 6,210,676 Bl 
21 22 



-continued 



15 10 15 

gac gtc gac aag tea cac gaa aac ccg atg ggc acc ate ctg ttc ggc 

Asp Val Aep Lys Ser His Glu Asn Pro Met Gly Thr lie Leu Phe Gly 
20 25 30 



tec aag ate etc gtg aag gag ggt gac acg gtc aag get ggt cag acc 
Ser Lys lie Leu Val Lys Glu Gly Asp Thr Val Lys Ala Gly Gin Thr 
65 70 75 80 

gtg etc gtt etc gag gee atg aag atg gag acc gag ate aae get cec 
Val Leu Val Leu Glu Ala Met Lys Met Glu Thr Glu He Asn Ala Pro 
85 90 95 



aaa aca cat aat act aaa gac aag ggt get gaa gaa ett gta aag tta 
Lys Thr His Aen Thr Lys Asp Lys Gly Ala Glu Glu Leu Val Lys Leu 
145 150 155 160 

get gaa tea gta gca ggc ttg eta aaa gta gcg caa gaa aca eta aat 
Ala Glu Ser Val Ala Gly Leu Leu Lys Val Ala Gin Glu Thr Leu Asn 
165 170 175 

aat tea gtt aaa gaa ett aca agt cct gtt gtg gca gaa agt cca aaa 
Asn Ser Val Lys Glu Leu Thr Ser Pro Val Val Ala Glu Ser Pro Lys 
180 185 190 

aaa cct taaeccgggg cggeegcg 
Lys Pro 



96 



ggc gge acc ggc ggc gcg ccg gca ccg gca gca ggt ggc gca ggc gee 144 

Gly Gly Thr Gly Gly Ala Pro Ala Pro Ala Ala Gly Gly Ala Gly Ala 

35 40 45 

ggt aag gcc gga gag ggc gag att ccc get ceg ctg gcc ggc ace gtc 192 

Gly Lys Ala Gly Glu Gly Glu He Pro Ala Pro Leu Ala Gly Thr Val 
50 55 60 



240 



288 



acc gac ggc aag gtc gag aag gtc ctg gte aag gag cgt gac gcg gtg 336 
Thr Aep Gly Lys Val Glu Lys Val Leu Val Lys Glu Arg Asp Ala Val 
100 105 110 

cag ggc ggt cag ggt etc ate aag ate ggg gat etc gag etc ate gaa 384 
Gin Gly Gly Gin Gly Leu lie Lys He Gly Asp Leu Glu Leu He Glu 
115 120 125 

ggt cgc gaa age ttc age tgg gat ccg gta eeg ata tea gat etc cca 432 
Gly Arg Glu Ser Phe Ser Trp Asp Pro Val Pro He Ser Asp Leu Pro 
130 135 140 



480 



528 



576 



600 



<210> SEQ ID NO 2 
<211> LENGTH: 194 
<212> TYPE: PRT 

<213> ORGANISM: Borrelia burgdorferi 
<400> SEQUENCE: 2 

Met Lys Leu Lys Val Thr Val Asn Gly Thr Ala Tyr Asp Val Asp Val 
15 10 15 

Asp Val Asp Lys Ser His Glu Asn Pro Met Gly Thr He Leu Phe Gly 
20 25 30 

Gly Gly Thr Gly Gly Ala Pro Ala Pro Ala Ala Gly Gly Ala Gly Ala 
35 40 45 

Gly Lys Ala Gly Glu Gly Glu He Pro Ala Pro Leu Ala Gly Thr Val 
50 55 60 

Ser Lys He Leu Val Lys Glu Gly Asp Thr Val Lys Ala Gly Gin Thr 
65 70 75 80 

Val Leu Val Leu Glu Ala Met Lys Met Glu Thr Glu He Asa Ala Pro 
85 90 95 

Thr Asp Gly Lys Val Glu Lys Val Leu Val Lys Glu Arg Asp Ala Val 
100 105 110 

Gin Gly Gly Gin Gly Leu He Lya He Gly Asp Leu Glu Leu He Glu 



us 6,210,676 Bl 
23 24 

-continued 

115 120 125 

Gly Arg Glu Ser Phe Ser Trp Asp Pro Val Pro He Ser Asp Leu Pro 
130 135 140 

Lys Thr His Asn Thr Lys Asp Lys Gly Ala Glu Glu Leu Val Lys Leu 
145 150 155 160 

Ala Glu Ser Val Ala Gly Leu Leu Lys Val Ala Gin Glu Thr Leu Asn 
165 170 175 

Asn Ser Val Lys Glu Leu Thr Ser Pro Val Val Ala Glu Ser Pro Lys 
180 185 190 

Lys Pro 



<210> SEQ ID NO 3 

<211> LENGTH; 36 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: PGR Primer 

<400> SEQUENCE: 3 

cgtggatcca tgaaaaagaa tacattaagt gcgata 



<210> SEQ ID NO 4 

<211> LENGTH: 34 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: PGR Primer 

<400> SEQUENCE: 4 

aattcccggg ttaaggtttt tttggacttt cgtc 



What is claimed is: 

1. An isolated, immunogenic polypeptide fragment of 
outer surface protein C (OspC) of Borrelia burgdorferi 
consisting essentially of an epitope of OspC having an 
amino acid sequence as shown in residues 145 to 194 of 
SEQ. ID. NO: 2. 

2. An isolated polypeptide having an amino acid sequence 
as shown in SEQ. ID. NO: 2. 

3. The isolated polypeptide of claim 2, further having an 
amino acid sequence as shown in residues 145 to 194 of 
SEQ ID NO: 2. 

4. A pharmaceutical composition to vaccinate against and 
to treat borreha infection in mammals, including humans, 
the composition comprising an amount of an isolated 
polypeptide consisting essentially of an amino acid sequence 
as shown in SEQ. ID. NO: 2, the amount being effective to 
vaccinate against or to treat borrelia infection in mammals. 

5. llie pharmaceutical composition of claim 4, further 
comprising a pharmaceutically suitable carrier. 



6. The pharmaceutical composition of claim 4, wherein 
the isolated polypeptide consists essentially of an amino acid 
sequence as shown in residues 145 to 194 of SEQ. ID. NO: 
2. 

7. Hie pharmaceutical composition of claim 6, further 
comprising a pharmaceutically suitable carrier. 

8. The pharmaceutical composition of claim 4, further 
comprising an adjuvant. 

9. The pharmaceutical composition of claim 6, further 
comprising an adjuvant. 

10. A kit for diagnosing borrelia infection in mammals, 
including humans, the kit comprising an isolated polypep- 
tide having an amino acid sequence as shown in SEQ. ID. 
NO: 2, disposed in a suitable container therefor, and instruc- 
tions for use of the kit. 

11. The kit of claim 10, wherein the isolated polypeptide 
has an amino acid sequence as shown in residues 145 to 194 
of SEQ. ID. NO: 2. 

% * « * 4c 



